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ACQUTRED RESISTANCE NPR GENES AND USES THEREOF 

Background of the Invention 
5 This invention relates to the fields of genetic engineering, plant biology, plant pathogen 
defense genes and their proteins, and crop protection. 

Recent advances in plant pathology have provided a basis for understanding the 
cellular and molecular genetic mechanisms by which plants defend themselves against 
pathogen attack. In particular, plants are known to utilize at least two different types of 
10 defense mechanisms: (i) the hypersensitive response ("HR") and (ii) acquired resistance 
( M AR"), including systemic acquired resistance ("SAR") and local acquired resistance 
("LAR"). These defense mechanisms are discussed below. 

The Hypersensitive Response 
Plants respond in a variety of ways to pathogenic microorganisms (Lamb, Cell 
15 76:419-422, 1994; Lamb et al., Cell 56:215-224, 1989). One well-studied defense response 
that occurs at the site of infection is called the hypersensitive response ("HR") and involves 
rapid localized necrosis of the infected plant cells or tissue or both. The rapid death of the 
infected cells is thought to deprive invading pathogens of a sufficient nutrient supply, 
arresting pathogen growth. Cells undergoing a HR exhibit nuclear DNA fragmentation (for 
20 example, DNA laddering), a hallmark of apoptosis first described in animal systems, 

indicating that the HR involves active, programmed cell death (Mittler et al., Plant Physiol. 
108:489-493, 1995; Greenberg et al, Cell 77: 551-563, 1994; Ryerson and Heath, Plant Cell 
8:393-402, 1996; Wang et al., Plant Cell 8, 375-391 , 1996). The HR is also accompanied by 
a membrane-associated oxidative burst that results in the NADPH-dependent production of 
25 0 2 and H 2 0 2 . These reactive oxygen species may be directly toxic to invading pathogens or 
may be involved in the crosslinking of plant cell walls surrounding the lesion to form a 
barrier to infection (Bradley et al., Cell 70:21-30, 1992; Levine et al., Cell 79:583-593, 1994). 

In the 1 950s, H.H. Flor developed a well-known genetic model that explains the 
observation that some races (strains) of a particular pathogen elicited a strong HR on a given 
30 cultivar of a host species, whereas other races (strains) of the same pathogen proliferated and 
caused disease (Flor, Annu. Rev. Phytopathoi 9:275-296, 1971). A pathogen that elicits an 
HR is said to be avirulent on that host, the host is said to be resistant, and the 
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plant-pathogen interaction is said to be incompatible. In contrast, strains which cause 
disease on a particular host are said to be virulent, the host is said to be susceptible, and the 
plant-pathogen interaction is said to be compatible. In many cases, the molecular basis of 
incompatibility appears to be due to a gene-for-gene correspondence between pathogen 
5 "avirulence" (avr) genes and host "resistance" (R) genes (Flor, Annu. Rev. Phytopathol. 

9:275-296, 1 971 ). A plant carrying a particular resistance gene will be resistant to pathogens 
carrying the corresponding avr gene. A simple molecular explanation for this gene-for-gene 
correspondence between avr and R genes is that avr genes generate signals for which 
resistance genes encode the cognate receptors. A signal transduction pathway then carries the 

1 0 a vr-generated signal to a set of target genes which initiates the HR and other host defenses 
(Gabriel and Rolfe, Annu, Rev. Phytopathol 28:365-391, 1990; Keen, Plant Mol Biol 
19:109-122, 1992; Lamb etal., Cell 56:215-224, 1989). 

A variety of avr genes have been cloned from bacterial and fungal phytopathogens 
(Keen, Plant Mol Biol 19:109-122, 1992) and, in at least two cases, gene-for-gene 

1 5 interactions have been demonstrated by experiments showing that a purified a vr-generated 
signal molecule will elicit an HR (Culver and Dawson, Mol Plant-Microbe Interact. 
4:458-463, 1991; Joosten et al., Nature 367:384-386, 1994; Knorr and Dawson, Proc, Natl 
Acad ScL, USA 85:170-174, 1988; van den Ackerveken et al., Plant J. 7:359-366, 1992). 
Several plant resistance genes have also been cloned in the past four years that conform to a 

20 classic gene-for-gene relationship. These include the tomato PTO gene (resistance to strains 
of P. syringae pv tomato expressing the avirulence gene avrPto (Martin et al., Science 
262:1432-1436, 1993)), the Arabidopsis RPS2 m&RPMl genes (resistance to P. syringae 
expressing the avirulence genes avrRptl or avrRpml, respectively (Bent et al., Science 
265:1856-1860, 1994; Grant et aL, Science 269:843-846 1995; Mindrinos et ah, Cell 

25 78:1 089- 1 099, 1 994)), the tobacco N gene (resistance to tobacco mosaic virus (Whitham et 
aL, Cell 78:1 101-1 105, 1994)), the tomato Cf? and CJ2 genes (resistance to the fungal 
pathogen C.fulvum (Dixon et aL, Cell 84:451-459, 1996; Jones et aL, Science 266, 789-794, 
1994)), the flax L h gene (resistance to the fungal pathogen Melampsora lini (Lawrence et aL, 
Plant Cell 7:1 195-1206, 1995)), and the rice Xa21 gene (resistance to Xanthomonas oryzae 

30 (Song et aL, Science 270:1804-1806, 1995)). 
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Acquired Resistancc-Svstemic and Local Acquired Resistance 

The HR not only blocks the local growth of an infecting pathogen, it is also thought to 
trigger additional defense responses in uninfected parts of the plant which become resistant to 
a variety of normally virulent pathogens (Enyedi et al., Cell 70:879-886, 1992; Malamy and 
5 Klessig, Plant J. 2:643-654, 1992). This latter phenomenon is called systemic acquired 
resistance (SAR) and is thought to be the consequence of the concerted activation of many 
genes that are often referred to as pathogenesis-related ("W?") genes. The biological 
functions of many of these PR genes remain unknown; however, a large body of 
physiological, biochemical, and molecular evidence suggests that particular PR genes play a 

1 0 direct role in conferring resistance to pathogens. For example, some PR genes encode 

chitinases and P-l,3-glucanases which directly inhibit pathogen growth in vitro (Mauch et al., 
Plant Physiol. 88:936-942, 1988; Ponstein et al., Plant Physiol. 104:109-1 18, 1994; 
Schlumbaum et al., Nature 324:365-367, 1986; Sela-Buurlage et al., Plant Physiol. 
101:857-863, 1993; Terras et al., J. Biol. Chem. 267:15301-15309, 1992; Woloshuk et al., 

15 Plant Cell 3:619-628, 1991). In addition, constitutive expression in transgenic plants of PR 
genes has been shown to decrease disease susceptibility in a limited number of cases 
(Alexander et al., Proc Natl. Acad, ScL USA 90:7327-7331, 1993; Liu et al., Proc. Natl. 
Acad. Sci. USA 91:1888-1892, 1994; Terras et al., Plant Cell 7:573-588, 1995; Zhu et al., 
Bio/Technology 12:807-812, 1994). 

20 SAR was originally defined by Ross (Virology 14:340-358, 1961), who demonstrated 

that tobacco became resistant to infection by a number of viruses after a primary inoculation 
with an avirulent strain of tobacco mosaic virus. Subsequently, it was demonstrated that 
SAR could also be elicited by other viruses, bacteria, and fungi, and that the resistance 
induced by any particular pathogen was effective against a broad spectrum of viral, bacterial, 

25 and fungal diseases (Cameron et al., Plant J. 5:715-725, 1994; Cruikshank and Mandryk, J. 
Aust. Inst. Agric. Sci. 26:369-372, 1960; Dempsey et al., Phytopathology 83:1021-1029, 
1993; Hecht and Bateman, Phytopathology 54:523-530, 1964; Kuc, Bioscience 39:854-860, 
1982; Lovrekovich et al., Phytopathology 58:1034-1035, 1968; Mauch-Mani and Slusarenko, 
Mol Plant-Microbe Interact, 7:378-383, 1994; Uknes et al., Mol Plant-Microbe Interact, 

30 6:692-698, 1993). 

Another acquired plant defense response that shares many features with SAR is 
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so-called local acquired resistance or "LAR/* LAR develops in the direct vicinity of a 
successfully proliferating pathogen to block further spread of the pathogen and to thwart the 
occurrence of secondary infections. The same set of PR proteins is believed to be involved in 
conferring resistance by both LAR and SAR, and, as described below, the same signalling 
5 molecules also appear to be required for the onset of both responses. 

Certain chemicals, such as salicylic acid (SA), 2,6-dichloroisonicotinic acid (INA), 
and benzo(l,2,3)thiadiazole-7-carbothioic acid S-methyl ester (BTH) have been shown to 
induce SAR or LAR or both when applied exogenously to plants (White, Virology 
99:410-412, 1979; Metraux el Science 250:1004-1006, 1991; Gorlach et aUPlant Cell 

10 8:629-643, 1996). Moreover, several lines of evidence indicate that endogenously produced 
SA is involved in the signal transduction pathway(s) coupling HR with the onset of SAR. In 
tobacco and cucumber, an increase in SA concentration has been observed after an avirulent 
pathogen infection when accompanied by the establishment of SAR (Goodman and Plurad, 
Physiol. Plant Pathol 1 :1 1-16, 1971; Malamy et al., Science 250:1002-1004, 1990; Metraux 

15 et *U Science 250: 1004-1006, 1990; Rasmussen et al., Plant Physiol 97:1342-1347, 1991). 
The accumulation of SA is also associated with the subsequent induction of genes including 
those encoding PR proteins (Van Loon and Van Kammen, Virology 40: 199-21 1, 1970; Ward 
et si.. Plant Cell 3: 1085-1094, 1991; Yalpani et al., Plant Cell 3:809-818, 1991). In tobacco 
and Arabidopsis, exogenously applied SA can induce the accumulation of PR mRNAs, which 

20 is a characteristic of SAR (Uknes et al., Plant Cell 4:645-656, 1 992; Ward et al., Plant Cell 
3:1085-1094, 1991; White, Virology 99:410-412, 1979). 

These results have led to the hypothesis that one of the consequences of pathogen 
infection is the accumulation of SA in vivo, which induces the expression of a set of proteins 
that act to limit further infection of the host (Waixl et al., Plant Cell 3:1085-1094, 1991). 

25 Direct support for this hypothesis has come from the observation that transgenic tobacco or 
Arabidopsis plants that express a bacterial gene encoding a salicylate hydroxylase are unable 
to accumulate SA and, consequently, do not exhibit either SAR or LAR (Gaffhey et al, 
Science 261 :754-756, 1993). Thus, SA is thought to be required in vivo for the establishment 
of SAR and LAR, and, as described above, PR gene products appear to participate directly in 

30 conferring pathogen resistance. 
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Summary of the Invention 
In general, the invention features an isolated nucleic acid molecule including a 
sequence encoding an acquired resistance (AR) polypeptide, wherein the acquired resistance 
polypeptide is at least 40% (and preferably 50%, 70%, 80%, or 90%) identical to the amino 
5 acid sequence of Fig. 5 (SEQ ID NO:3) or Fig. 7B (SEQ ID NO:14). Preferably, such a 
nucleic acid molecule encodes an acquired resistance polypeptide that mediates the 
expression of a pathogenesis-related polypeptide. In another preferred embodiment, the 
acquired resistance polypeptide includes an ankyrin-repeat motif. 

Nucleic acid molecules of the invention are derived from any plant species, including, 
10 without limitation, angiosperms (for example, dicots and monocots) and gymnosperms. 
Exemplary plants from which the nucleic acid may be derived include, without limitation, 
sugar cane, wheat, rice, maize, sugar beet, potato, barley, manioc, sweet potato, soybean, 
sorghum, cassava, banana, grape, oats, tomato, millet, coconut, orange, rye, cabbage, apple, 
watermelon, canola, cotton, carrot, garlic, onion, pepper, strawberry, yam, peanut, onion, 

1 5 bean, pea, mango, and sunflower. Preferred nucleic acid molecules are derived from 
cruciferous plants, for example, Arabidopsis thaliana. Examples of cruciferous acquired 
resistance molecules are shown in Fig. 4 (NPR genomic DNA; SEQ ID NO:l) and Fig. 5 
(NPR cDNA; SEQ ID NO:2). Other preferred nucleic acid molecules are derived from 
solanaceous plants, for example, Nicotiana glutinosa. An example of such a solanaceous 

20 acquired resistance molecule is shown in Fig. 7 A (SEQ. ID NO: 13). 

. . ^ In another aspect, the invention features an isolated nucleic acid molecule (for: 

example, a DNA molecule) that encodes an acquired resistance polypeptide that specifically 
hybridizes to a nucleic acid molecule that includes the nucleic acid sequence of Fig. 4 (NPR 
genomic DNA; SEQ ID NO: 1 ), Fig. 5 (NPR cDNA; SEQ ID NO:2), or Fig. 7A (SEQ ID 

25 NO: 1 3). Preferably, the specifically hybridizing nucleic acid molecule encodes an acquired 
resistance polypeptide that mediates the expression of a pathogenesis-related polypeptide. In 
another preferred embodiment, the specifically hybridizing nucleic acid molecule encodes an 
acquired resistance polypeptide including an ankyrin-repeat motif. In yet other preferred 
embodiments, the specifically hybridizing nucleic acid molecule complements an acquired 

30 resistance mutant (for example, an Arabidopsis npr mutant). The invention also features an 
RNA transcript having a sequence complementary to any of the isolated nucleic acid 
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molecules described above. 

In related aspects, the invention further features a cell or a vector (for example, a plant 
expression vector), each of which includes an isolated nucleic acid molecule of the invention. 
In preferred embodiments, the cell is a bacterium (for example, E. coli or Agrohacterium 
5 tumefaciens) or is a plant cell (for example, is a cell from any of the crops listed above). 
Such a plant cell has an increased level of resistance against a disease caused by a plant 
pathogen (for example, Phytophthora, Peronospora, or Pseudomonas). In yet another 
preferred embodiment, the isolated nucleic acid molecule of the invention is operably linked 
to an expression control region that mediates expression of a polypeptide encoded by the 

1 0 nucleic acid molecule. For example, the expression control region is capable of mediating 
constitutive, inducible (for example, pathogen- or wound-inducible), or cell- or tissue-specific 
gene expression. The invention further features a cell (for example, a bacterium such as E. 
coli ox Agrohacterium tumefaciens, or a plant cell) which contains the vector of the invention. 
In still another aspect, the invention features a transgenic plant including any of the 

15 above nucleic acid molecules of the invention integrated into the genome of the plant, 
wherein the nucleic acid molecule is expressed in the transgenic plant. In addition, the 
invention features seeds and cells from such transgenic plants. For example, such transgenic 
plants may be produced according to conventional methods using any of the above crop 
plants. 

20 In yet another aspect, the invention features a substantially pure acquired resistance 

- s- polypeptide including an amino acid sequence that has at least 40% (andprefcrably; 50%, 
60%, 70%, 80% or 90%) identity to the amino acid sequence of 
Fig. 5 (SEQ ID NO:3) or Fig. 7B (SEQ ID NO:14). Preferably, the acquired resistance 
polypeptide mediates the expression of a pathogenesis-related polypeptide. In other preferred 

25 embodiments, the acquired resistance polypeptide includes an ankyrin-repeat motif or a G- 
protein coupled receptor motif. Such acquired resistance polypeptides are derived from any 
plant species, for example, those crop plants mentioned above. In preferred embodiments, 
the polypeptide of the invention is derived from a cruciferous species, for example, 
Arabidopsis thaliana, or from a solanaceous species, for example, Nicotiana glutinosa. 

30 In a related aspect, the invention also features a method of producing an acquired 

resistance polypeptide. The method involves: (a) providing a cell transformed with a nucleic 
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acid molecule of the invention positioned for expression in the ceil; (b) culturing the 
transformed cell under conditions for expressing the nucleic acid molecule; and (c) 
recovering the acquired resistance polypeptide. The invention further features a 
recombinant acquired resistance polypeptide produced by such expression of an isolated 
5 nucleic acid molecule of the invention, and a substantially pure antibody that specifically 
recognizes and binds to an acquired resistance polypeptide or a portion thereof. 

In another aspect, the invention features a method of providing an increased level of 
resistance against a disease caused by a plant pathogen in a transgenic plant. The method 
involves: (a) producing a transgenic plant cell including the nucleic acid molecule of the 

1 0 invention integrated into the genome of the transgenic plant cell and positioned for 

expression in the plant cell; and (b) growing a transgenic plant from the plant cell wherein the 
nucleic acid molecule is expressed in the transgenic plant and the transgenic plant is thereby 
provided with an increased level of resistance against a disease caused by a plant pathogen. 
In another aspect, the invention features methods of isolating an acquired resistance 

1 5 gene or fragment thereof. The first method involves: (a) contacting the nucleic acid molecule 
of the invention or a portion thereof with a preparation of DNA from a plant cell under 
hybridization conditions providing detection of DNA sequences having 40% or greater 
sequence identity to the nucleic acid sequence of Fig. 4 (SEQ ID NO:l), Fig. 5 (SEQ ID 
NO:2), or Fig. 7A (SEQ ID NO: 1 3); and (b) isolating the hybridizing DNA as an acquired 

20 resistance gene or fragment thereof. The second method involves: (a) providing a sample of 
plant cell DNA; (b) providing a pair of oligonucleotides having sequence homology to a 
region of a nucleic acid molecule of the invention; (c) contacting the pair of oligonucleotides 
with the plant cell DNA under conditions suitable for polymerase chain reaction-mediated 
DNA amplification; and (d) isolating the amplified acquired resistance gene or fragment 

25 thereof. 

In preferred embodiments of the second method, the amplification step is carried out 
using a sample of cDNA prepared from a plant cell. In addition, the pair of oligonucleotides 
used in the second method are based on a sequence encoding an acquired resistance 
polypeptide, wherein the acquired resistance polypeptide is at least 40% (and preferably 50%, 
30 60%, 70%, 80%, or 90%) identical to the amino acid sequence of Fig. 5 (SEQ ID NO:3) or 
Fig. 7B (SEQ ID NO: 14). 
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By "acquired resistance" gene or " AR" gene is meant a gene encoding a polypeptide 
capable of triggering a plant acquired resistance response (for example, a systemic acquired 
resistance (SAR) or local acquired resistance response (LAR)) in a plant cell or plant tissue. 
This response may occur at the transcriptional level or it may be enzymatic or structural in 
5 nature. AR genes may be identified and isolated from any plant species, especially 
agronomically important crop plants, using any of the sequences disclosed herein in 
combination with conventional methods known in the art. 

By "polypeptide" is meant any chain of amino acids, regardless of length or post- 
translational modification (for example, glycosylation or phosphorylation). 
1 o By "pathogenesis-relatcd" polypeptide or "PR" polypeptide is meant a polypeptide 

that is expressed in conjunction with the establishment of SAR or LAR. Exemplary PR 
proteins include, without limitation, chitinase, PR-la, PR1, PR5, GST (glutathione-S- 
transferase), and P-1,3 glucanase, osmotin, thionin, glycine-rich proteins (GRPs), 
phenylalanine ammonia lyase (PAL), and lipoxygenase (LOX). 
1 5 By "ankyrin-repeat" motif is meant a consensus motif that is found in a wide variety 

of proteins that arc capable of mediating protein-protein interactions. Ankyrin-repeat motifs 
are described in Michaely and Bennett (Trends in Cell Biology 2:127-129, 1 992) and Bork 
(Proteins: Structure. Function, and Genetics 17:363-374, 1993). 

By "substantially identical" is meant a polypeptide or nucleic acid exhibiting at least 
20 40%, preferably 50%, more preferably 80%, and most preferably 90%, or even 95% 

homology to a reference amino acid sequence (for example, the amino acid sequence shown, 
in Fig. 5 (SEQ ID NO:3) or Fig. 7B (SEQ ID NO: 14)) or nucleic acid sequence (for example, 
the nucleic acid sequences shown in Fig. 4, or Fig. 5, or Fig. 7A, SEQ ID NOS:l, 2, and 13, 
respectively). For polypeptides, the length of comparison sequences will generally be at least 
25 1 6 amino acids, preferably at least 20 amino acids, more preferably at least 25 amino acids, 
and most preferably 35 amino acids. For nucleic acids, the length of comparison sequences 
will generally be at least 50 nucleotides, preferably at least 60 nucleotides, more preferably at 
least 75 nucleotides, and most preferably 1 10 nucleotides. 

Sequence identity is typically measured using sequence analysis software (for 
30 example, Sequence Analysis Software Package of the Genetics Computer Group, University 
of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 53705, BLAST, 
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or PILEUP/PRETTYBOX programs). Such software matches identical or similar sequences 
by assigning degrees of homology to various substitutions, deletions, and/or other 
modifications. Conservative substitutions typically include substitutions within the following 
groups: glycine alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, 
5 glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 

By a "substantially pure polypeptide" is meant an AR polypeptide (for example, an 
NPR polypeptide such as NPR1) that has been separated from components which naturally 
accompany it. Typically, the polypeptide is substantially pure when it is at least 60%, by 
weight, free from the proteins and naturally-occurring organic molecules with which it is 

1 0 naturally associated. Preferably, the preparation is at least 75%, more preferably at least 
90%, and most preferably at least 99%, by weight, an AR polypeptide. A substantially pure 
AR polypeptide may be obtained, for example, by extraction from a natural source (for 
example, a plant cell); by expression of a recombinant nucleic acid encoding an AR 
polypeptide; or by chemically synthesizing the protein. Purity can be measured by any 

1 5 appropriate method, for example, column chromatography, polyacrylamide gel 
electrophoresis, or by HPLC analysis. 

By "derived from" is meant isolated from or having the sequence of a naturally- 
occurring sequence (e.g., a cDNA, genomic DNA, synthetic, or combination thereof). 

By "isolated DNA" is meant DNA that is free of the genes which, in the naturally- 

20 occurring genome of the organism from which the DNA of the invention is derived, flank the 
gene. The term therefore includes, for example, a recombinant DNA that is incorporated jnto 
a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a 
prokaryote or eukaryote; or that exists as a separate molecule (for example, a cDNA or a 
genomic or cDNA fragment produced by PGR or restriction endonuclease digestion) 

25 independent of other sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence. 

By "specifically hybridizes" is meant that a nucleic acid sequence is capable of 
hybridizing to a DNA sequence at least under low stringency conditions as described herein, 
and preferably under high stringency conditions, also as described herein. 

30 By "transformed cell" is meant a cell into which (or into an ancestor of which) has 

been introduced, by means of recombinant DNA techniques, a DNA molecule encoding (as 
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used herein) an AR polypeptide. 

By "positioned for expression" is meant that the DNA molecule is positioned adjacent 
to a DNA sequence which directs transcription and translation of the sequence (i.e., facilitates 
the production of, for example, an AR polypeptide, a recombinant protein, or an RNA 
5 molecule). 

By "reporter gene" is meant a gene whose expression may be assayed; such genes 
include, without limitation, P-glucuronidase (GUS), luciferase, chloramphenicol 
transacetylase (CAT), green fluorescent protein (GFP), B-galactosidase, herbicide resistant 
genes and antibiotic resistance genes. 

10 By "expression control region" is meant any minimal sequence sufficient to direct 

transcription. Included in the invention are promoter elements that are sufficient to render 
promoter-dependent gene expression controllable for cell-, tissue-, or organ-specific gene 
expression, or elements that are inducible by external signals or agents (for example, light-, 
pathogen-, wound-, stress-, or hormone-inducible elements or chemical inducers such as SA 

15 or INA); such elements may be located in the 5 ! or 3' regions of the native gene or engineered 
into a transgene construct. 

By "operably linked" is meant that a gene and a regulatory sequence(s) are connected 
in such a way as to permit gene expression when the appropriate molecules (for example, 
transcriptional activator proteins) are bound to the regulatory sequence(s). 

20 By "plant cell" is meant any self-propagating cell bounded by a semi-permeable 

membrane and containing a plastid. Such a cell also requires a cell wall if further 
propagation is desired. Plant cell, as used herein includes, without limitation, algae, 
cyanobacteria, seeds, suspension cultures, embryos, meristematic regions, callus tissue, 
leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. 

25 By "crucifer" is meant any plant that is classified within the Cruciferae family. The 

Cruciferae include many agricultural crops, including, without limitation, rape (for example, 
Brassica campestris and Brassica napus), broccoli, cabbage, brussel sprouts, radish, kale, 
Chinese kale, kohlrabi, cauliflower, turnip, rutabaga, mustard, horseradish, and Arabidopsis. 
By "transgene" is meant any piece of DNA which is inserted by artifice into a cell, 

30 and becomes part of the genome of the organism which develops from that cell. Such a 
transgene may include a gene which is partly or entirely heterologous (i.e., foreign) to the 
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transgenic organism, or may represent a gene homologous to an endogenous gene of the 
organism. 

By "transgenic" is meant any cell which includes a DNA sequence which is inserted 

by artifice into a cell and becomes part of the genome of the organism which develops from 
5 that cell. As used herein, the transgenic organisms are generally transgenic plants and the 

DNA (transgene) is inserted by artifice into the nuclear or plastidic genome. A transgenic 

plant according to the invention may contain one or more acquired resistance genes. 

By "pathogen" is meant an organism whose infection of viable plant tissue elicits a 

disease response in the plant tissue. Such pathogens include, without limitation, bacteria, 
10 mycoplasmas, fungi, insects, nematodes, viruses, and viroids. Plant diseases caused by these 

pathogens are described in Chapters 11-16 of Agrios, Plant Pathology, 3rd ed., Academic 

Press, Inc., New York, 1988. 

Examples of bacterial pathogens include, without limitation, Erwinia (for example, E. 

carotovoraX Pseudomonas (for example, P. syringae), and Xanthomonas (for example, X. 
1 5 campepestris and X. oryzae). 

Examples of fungal disease-causing pathogens include, without limitation, Alternaria 

(for example, A. brassicola and A.solani), Ascochyta (for example, A. pisi), Botrytis (for 

example, B. cinerea), Cercospora (for example, C. hkuchii and C zaea-maydis\ 

Colletotrichum sp. (for example, C. lindemuthianum), Diplodia (for example, ZX maydis), 
20 Erysiphe (for example, E. graminis f.sp. graminis and E. graminis f.sp. hordei), Fusarium 

(for example, F. nivale and P. oxysporum.F. graminearum, p. solani, F. monilforme, and F. ; 

roseum\ Gaeumanomyces (for example, G graminis f.sp. tritici), Helminthosporium (for 

example, H. turcicum, H. carbonum y and H. maydis), Macrophomina (for example, M. 

phaseolina and Maganaporthe grisea), Nectria (for example, N. heamatocacca), 
25 Peronospora (for example, P. manshurica y P. tabacina), Phoma (for example, P. betae\ 

Phymatotrichum (for example, P omnivorum), Phytophthora (for example, P cinnamomi, P. 

cactorum, P. phaseoli, P. parasitica, P. citrophthora, P. megasperma f.sp. sojae, and P. 

infestans), Plasmopara (for example, P. viticola), Podosphaera (for example, P. leucotricha), 

Puccinia (for example, P. sorghi $ P. striiformis, P graminis fsp. tritici, P. asparagi, P. 
30 recondita y and P. arachidis), Puthium (for example, P. aphanidermatum), Pyrenophora (for 

example, P. tritici-repentens\ Pyricularia (for example, P. oryzea\ Pythium (for example, P. 
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ultimum), Rhizoctonia (for example, R. solani and R. cerealis), Scerotium (for example, S. 
rolfsii), Sclerotinia (for example, S. sclerotiorum), Septoria (for example, S. lycopersici y S. 
glycines, S. nodorum and S. tritici), Thielaviopsis (for example, T. hasicola), Uncinula (for 
example, U. necator), Venturia (for example, V. inaequalis), Verticillium (for example, V. 
5 dahliae and V. albo-atrum). 

Examples of pathogenic nematodes include, without limitation, root-knot nematodes 
(for example, Meloidogyne sp. such as M incognita, M arenaria, M chitwoodi, M. hapla, M 
javanica, M graminocola, M. microtyla, M. graminis, and M. naasi), cyst nematodes (for 
example, Heterodera sp. such as //. schachtii, H. glycines* H. saccharic H. oryzae, H. avenae, 

1 0 H. cajani, H. elachista, H. goettingiana, K graminis, H, mediterranean H. mothi, 11. sorghi, 
and H. zeae, or, for example, Globodera sp. such as C. rostochiensis and G. pallida), root- 
attacking nematodes (for example, Rotylenchulus reniformis, Tylenchuylus semipenetrans, 
Pratylenchus brachyurus, Radopholus citrophilus, Radopholus similis y Xiphinema 
americanum, Xiphinema rivesi 9 Paratrichodorus minor, Heterorhabditis heliothidis, and 

1 5 Bursaphelenchus xylophilns) 7 and above-ground hematodes (for example, Anguina funesta, 
Anguina tritici, Ditylenchus dipsaci, Ditylenchus myceliphagus, and Aphenlenchoides 
besseyi). 

Examples of viral pathogens include, without limitation, tobacco mosaic virus, 
tobacco necrosis virus, potato leaf roll virus, potato virus X, potato virus Y, tomato spotted 

20 wilt virus, and tomato ring spot virus. 

By "increased level of resistance" is meant a greater level of resistance to a disease- 
causing pathogen in a transgenic plant (or cell or seed thereof) of the invention than the level 
of resistance relative to a control plant (for example, a non-transgenic plant). In preferred 
embodiments, the level of resistance in a transgenic plant of the invention is at least 20% (and 

25 preferably 30% or 40%) greater than the resistance of a control plant. In other preferred 
embodiments, the level of resistance to a disease-causing pathogen is 50% greater, 60% 
greater, and more preferably even 75% or 90% greater than a control plant; with up to 100% 
above the level of resistance as compared to a control plant being most preferred. The level 
of resistance is measured using conventional methods. For example, the level of resistance to 

30 a pathogen may be determined by comparing physical features and characteristics (for 

example, plant height and weight, or by comparing disease symptoms, for example, delayed 
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lesion development, reduced lesion size, leaf wilting and curling, water-soaked spots, and 
discoloration of cells) of transgenic plants. 

By "detectably-labelled" is meant any direct or indirect means for marking and 
identifying the presence of a molecule, for example, an oligonucleotide probe or primer, a 
5 gene or fragment thereof, or a cDNA molecule or a fragment thereof. Methods for 

detectably-labelling a molecule are well known in the art and include, without limitation, 
radioactive labelling (for example, with an isotope such as 32 P or 35 S) and nonradioactive 
labelling (for example, chemiluminescent labelling, for example, fluorescein labelling). 

By "purified antibody" is meant antibody which is at least 60%, by weight, free from 
10 proteins and naturally-occurring organic molecules with which it is naturally associated. 

Preferably, the preparation is at least 75%, more preferably 90%, and most preferably at least 
99%, by weight, antibody, for example, an acquired resistance polypeptide-specific antibody. 
A purified AR antibody may be obtained, for example, by affinity chromatography using a 
recombinantly-produced acquired resistance polypeptide and standard techniques. 
1 5 By "specifically binds" is meant an antibody which recognizes and binds an AR 

protein but which does not substantially recognize and bind other molecules in a sample, for 
example, a biological sample, which naturally includes an AR protein such as NPR. 

As discussed above, fundamental acquired resistance genes that are responsible for 

20 providing plants with the ability to protect themselves against pathogens have been identified. 

L Accordingly, the invention provides a number of important advances and. ad vantages for the 
protection of plants against their pathogens. For example, by providing AR genes as 
described herein that are readily incorporated and expressed in all species of plants, the 
invention facilitates an effective and economical means for in-plant protection against plant 

25 pathogens. Such protection against pathogens reduces or minimizes the need for traditional 
chemical practices (for example, application of fungicides, bactericides, nematicides, 
insecticides, or viricides) that are typically used by farmers for controlling the spread of plant 
pathogens and providing protection against disease-causing pathogens. In addition, because 
plants expressing one or more acquired resistance gene(s) described herein are less vulnerable 

30 to pathogens and their diseases, the invention further provides for increased production 

efficiency, as well as for improvements in quality and yield of crop plants and ornamentals. 
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Thus, the invention contributes to the production of high quality and high yield agricultural 
products: for example, fruits, ornamentals, vegetables, cereals and field crops having reduced 
spots, blemishes, and blotches that are caused by pathogens; agricultural products with 
increased shelf-life and reduced handling costs; and high quality and yield crops for 
5 agricultural (for example, cereal and field crops), industrial (for example, oilseeds), and 

commercial (for example, fiber crops) purposes. Furthermore, because the invention reduces 
the necessity for chemical protection against plant pathogens, the invention benefits the 
environment where the crops are grown. Genetically-improved seeds and other plant 
products that are produced using plants expressing the genes described herein also render 

10 fanning possible in areas previously unsuitable for agricultural production. The invention 
further provides a means for mediating the expression of pathogenesis-related proteins, for 
example, chitinase and GST, that confer resistance to plant pathogens. For example, 
transgenic plants constitutively producing an AR gene product are capable of activating PR 
gene expression, which in turn confers resistance to plant pathogens. Collective PR gene 

1 5 expression that is mediated by the AR gene product obviates the need to express individual 
PR genes as a means to promote plant defense mechanisms. 

The invention is also useful for providing nucleic acid and amino acid sequences of an 
AR gene that facilitates the isolation and identification of AR genes from any plant species. 
Other features and advantages of the invention will be apparent from the following 

20 description of the preferred embodiments thereof, and from the claims. 

Detailed Description 
The drawings will first be described. 
Drawings 

25 Fig. 1 is a schematic illustration showing the physical map of A. thaliana 

chromosome I and the position of NPRL 

Fig. 2A is a photograph of a Northern blot analysis showing the expression of the PR- 

1 gene in wild type plants(Col-0, lanes 1-3), nprl-2 mutant plants(lanes 4-6), nprl-2 

transform ants with a noncomplementing cosmid (m305-2-7, lanes 7-9), and nprl-2 
30 transformants with complementing cosmids (2 1 A4-P5- 1 , lanes 10-12 and 2 1 A4-6- 1 - 1 , lanes 

13-15). RNA samples were prepared from fifteen-day old seedlings grown on MS media 
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(lanes 1, 4, 7, 10, and 13), MS media with 0.1 mM INA (lanes 2, 5, 8, 1 1, and 14), and MS 
media with 0.1 mM SA (lanes 3, 6, 9, 12, and 15). 

Fig. 2B is a series of photographs showing disease symptoms (top panels) and BGL2- 
GUS expression (bottom panels) induced by Psm ES4326 on wild-type (left panels), nprl-1 
5 (middle panels), and an nprl-1 transformant with a complementing cosmid (21 A4-4-3-1, 
right panels). 

Fig. 2C is a panel of graphs showing the growth of Psm ES4326 in wild-type, nprl-2, 
and an nprl-2 transformant with a complementing cosmid (21 A4-P5-1). Error bars represent 
95% confidence limits of log-transformed data as described by Sokal and Rohlf (Biometry, 2d 
1 0 ed., W.H. Freeman and Company, New York, 1 981 ). 

Fig. 2D is a panel of bar graphs showing the disease rating of P. parasitica NOCO 
infection in wild type, nprl-2, and an nprl-2 transformant with a complementing cosmid 
(21 A4-P5-1). The disease rating scales are defined as follows: 0, no conidiophores on the 
plant; 1, no more than 5 conidiophores per infected leaf; 2, 3-20 conidiophores on a few 
1 5 infected leaves; 3, 6-20 condiophores on most infected leaves; 4, 5 or more conidiophores on 
all infected leaves; 5, 20 or more conidiophores on all infected leaves. 

Fig. 3 is a schematic illustration showing the restriction map of the 7.5-kb region 
containing the NPR1 gene. 

Fig. 4 is a schematic illustration showing the genomic sequence of the 7.5-kb region 
20 containing the acquired resistance nucleic acid sequence of the gene termed NPR1 (SEQ ID 
NO:l ) from Arabidopsis thaliana. 

Fig. 5 is a schematic illustration showing the cDNA sequence (SEQ ID NO:2) and 
deduced amino acid sequence (SEQ ID NO:3) of the acquired resistance protein termed 
NPR1 from Arabidopsis thaliana. Amino acids numbered 262-289, 323-371, and 453-469 
25 show homology to a mouse ankyrin protein, an ankyrin -repeat motif, and a G-protein coupled 
receptor moti f, respectively. 

Fig. 6A is a schematic illustration showing the alignment of the NPR1 amino acid 
sequence with mouse ankyrin 3 (ANKB). Two regions producing the highest scoring pairs 
(smallest sum probability = 0.0004) generated using a BLAST search are shown. The 
30 identical and similar amino acids (+) are highlighted in bold, circled letters. 

Fig. 6B is a schematic illustration showing the alignment of the ankyrin repeats in 
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NPRI with the ankyrin repeat consensus derived from Michaely and Bennett (Trends in Cell 
Biology 2:127-129, 1992) and Bork (Proteins: Structure, Function, and Genetics 17:363-374, 
1 993). Since there are a few non-overlapping amino acids between the two derived 
consensus sequences, both are presented. In the consensus derived from Bork, the conserved 
5 features are indicated: t, turn-like or polar; o, S/T; h, hydrophobic; capitals, conserved amino 
acids. Those amino acids identical to the consensus are highlighted in bold, circled letters. 

Fig. 7A is a schematic illustration showing the cDNA sequence 
(SEQ ID NO: 13) of an NPR1 homolog isolated from Nicotiana glutinosa. 

Fig. 7B is a schematic illustration showing the deduced amino acid sequence of the 
1 0 NPR 1 homolog of Nicotiana glutinosa (SEQ ID NO: 1 4) shown in Fig. 7 A. 

Fig. 8 A is a graph illustrating the dosage effect of NPR 1 on the resistance of 
transgenic Arabidopsis to the bacterial pathogen, Psm ES4326. Eight samples were taken at 
each time point for the Psm ES4326 infection (initial inoculant OD^^.OOl). Error bars 
represent 95% confidence limits of log-transformed data. Colony forming unit is designated 
1 5 as cfu. 

Fig. 8B is a histogram showing the dosage effect of NPR1 on the resistance of 
transgenic Arabidopsis to the fungal pathogen, Peronspora parasitica NOC02. A spore 
suspension (3x10* spores/mL) of P. parasitica was used for these infection studies, and the 
number of conidiophores on each plant was counted seven days after infection. The data 
20 were analyzed using Wilcoxon two-sample tests. At the 95% confidence level, significant 
difference in growth was present between all pairs of samples except ColNPRl-M and 
ColNPRl-H, and Col and ColNPRl-L. 

Fig. 9A are photographs showing the restoration of inducible BGL2-GUS expression 
in 35S-NPRI-GFP transgenic plants. Seedlings were grown on either MS or MS-1NA (0. 1 
25 mM) media for fourteen days and stained for GUS activity. 

Fig. 9B is a photograph showing the complementation of the SA sensitivity in the 
Arabidopsis nprl mutant by 35S-NPR1-GFP. Seedlings were grown for eleven days on MS- 
SA (0.5 mM) medium. The NPR1-GFP transgene restored normal growth to nprl on SA. 
The mGFP transgene, however, was unable to restore normal growth to nprl. Note that the 
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NPR1-GFP line used was in the T 2 generation. The observed 3:1 segregation ratio indicated 
that the transgenic plants contained a single locus NPRl-GFP insertion. 

Fig. 9C is a histogram showing the restoration of P. parasitica resistance to the T 2 
NPRl-GFP transformants. IN A treatment (0.65 mM) was carried out seventy-two hours 
5 prior to infection with a spore suspension (3xl0 4 spores/mL). The disease symptoms were 
scored seven days after the infection with respect to the number of conidiophores on the 
plant. The disease rating scale is defined as: 0, no conidiophores on the plant; 1 , no more than 
5 conidiophores per infected leaf; 2, 6-20 conidiophores on a few infected leaves; 3, 6-20 
conidiophores on most of the infected leaves; 4, 5 or more conidiophores on all infected 

10 leaves; 5, 20 or more conidiophores on all infected leaves. Seedlings in the 0, 4, and 5 

categories were also examined for the presence of the NPR1-GFP transgene, and the number 
of NPRl-GFP transformants is indicated in the parenthesis. Most of the P. parasitica 
resistant plants (0 category) contained the NPRl-GFP transgene; however, all of the sensitive 
plants (4 and 5 categories) were observed to segregate as non-transformants lacking the 

1 5 transgene. 

Fig. 10 is a photograph showing the localization of NPRl-GFP in response to 
chemical activators of SAR. The transformants, containing either the NPRl-GFP (top and 
bottom panels) or mGFP transgene (middle panels) were grown for eleven days on MS or 
MS-INA media. GFP fluorescence was visualized by confocal microscopy in leaf mesophyll 
20 cells and guard cells. DIC is shown in the red channel and GFP is shown in the green 
channel. 

Figs. 1 1 A- 1 1G are a series of photographs showing the localization of NPRl-GFP in 
response to Psm ES4326 infection. Leaves of NPRl-GFP transformants were infiltrated on 
the left half with either Psm ES4326 (Fig. i IB) or 1 0 mM MgCl 2 (Fig. 1 IE) and stained for 
25 BGL2-GUS expression after three days. Prior to GUS staining the leaves were analyzed for 
GFP localization on the infiltrated (Fig. 1 1 A and Fig. 1 ID) and the uninfiltrated (Fig. 1 1C) 
side. Leaves of mGFP transformants were infiltrated with Psm ES4326 (Fig. 1 IF) or 10 mM 
MgCl 2 (Fig. 1 1 G) and analyzed for GFP localization. 
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Overview 

^ A genetic study was conducted using Arabidopsis thaliana as a model system to 

identify key elements that control the signaling pathway leading to the induction of acquired 
resistance (AR) t for example, a system acquired resistance (SAR) response, to pathogen 
5 infection in plants. In wild-type Arabidopsis plants, SAR responses can be induced by 

treatment with 0.1 mM salicylic acid (SA) or 0.1 mM 2,6-dichloroisonicotinic acid (IN A) or 
after an infection by an avirulent pathogen such as Pseudomonas syringae pv phaseolicola 
NP3121 lavrRpH (P.s. phaseolicola 3\2\lawRpt2), SAR is demonstrated by enhanced 
resistance to virulent pathogens, such as Pseudomonas syringae pv maculicola ES4326 (P.s. 

1 0 maculicola ES4326), and by increased expression of patho genesis-related genes (for 
example, PR genes including PR1, BGL2, and PR5). To facilitate detection of PR gene 
expression and identification of mutants that were aberrant in the SAR signaling pathway, a 
BGL2-GUS reporter gene was constructed and transformed into Arabidopsis thaliana ecotype 
Columbia. This parental line containing the BGL2-GUS transgene was mutagenized by 

1 5 treatment of seeds with 0.3% ethyl methanesulfonate for eleven hours. The M2 progeny of 
the mutagenized population were screened for the lack of BGL2-GUS expression in the 
presence of the SAR-inducers SA and INA (Cao et al, Plant Cell 6:1583-1592, 1994). 

Using these techniques, the nprl-1 (nonexpresser of £R genes) mutant was isolated 
and found to have almost complete lack of expression of the BGL2-GUS reporter gene, as 

20 well as a lack of expression of the endogenous PR J, BGL2, and PR5 genes in response to SA, 
INA, and avirulent pathogen treatments (Cao et al., Plant Cell 6:1583-1592, 1994). Further 
characterization of the nprl-1 mutant showed that mutations in the NPR1 gene completely 
blocked the induction of SAR. In the nprl-1 plants pretreated with SA, INA, or an avirulent 
pathogen, growth of virulent pathogens (for example, P.s. maculicola ES4326) was not 

25 inhibited, as found in the parental line carrying the wild-type NPR1 gene. This finding 
demonstrated that the NPR1 gene plays a key role in the signaling pathway leading to the 
establishment of SAR. 

Two additional nprl mutants, nprl-2 and nprl-3, were isolated on the basis that they 
were more susceptible to infection than wild-type plants by P.s. maculicola strain ES4326 
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(Glazebrook et aL, Genetics 143:973-982, 1996). Genetic complementation tests showed that 
nprl-l, nprl-2, and nprl-3 were allelic. 

The NPR1 gene not only controls the onset of systemic resistance, but also was found 
to affect local acquired resistance ("LAR"), the ability of plants to restrict the spread of 
5 virulent pathogen infections. In nprl mutant plants, the virulent pathogen P.s. maculicola 
ES4326 grows to a greater extent and spreads further beyond the initial site of invasion than 
in the wild-type plants. The effects of the impaired SAR and LAR in nprl mutants is also 
evident when various strains of Peronospora parasitica were tested. Disease symptoms (i.e., 
downy mildew) were observed after infection by strains of P. parasitica to which the 

1 0 wild-type parental line of Arabidopsis is resistant, showing the break down of the "natural" 
resistance in the nprl mutants. The effects of the nprl mutations appeared to be specific to 
the defense response. No significant morphological phenotypes were observed in three allelic 
nprl mutants, nprl-1, nprl -2, nprl-3. However, when grown on medium containing a high 
concentration of SA (0.5 mM), the growth of all three nprl mutants was arrested at the 

1 5 cotyledon stage, and the seedlings were bleached. Wild-type plants were observed to grow 
normally in the presence of 0.5 mM SA. 

The phenotypes of the nprl mutants clearly demonstrated the biological significance 
of the NPR1 gene of Arabidopsis thaliana in controlling the defense response against a broad 
spectrum of pathogens. 

20 The NPR1 gene was cloned using a map-based positional cloning strategy. The 

location of NPR1 on the Arabidopsis genome was first delimited to a 7.5-kilobase (kb) region 
contained on cosmid clones 21 A4-4-3-1, 21A4-6-1-1, 21A4-P5-1, 21A4-P4-1, and 21 A4-2-1 
by its ability to complement the nprl mutant. An SA-inducible 2.0-kb RNA transcript 
encoded within this 7.5-kb region corresponding to NPR1 was identified by RNA blot 

25 analysis. Isolation of this acquired resistance gene facilitates the cloning of AR genes from 
plants of agricultural or economic importance. For example, engineering ectopic expression 
of AR genes (for example, an NPR gene) in crop plants, which is useful for providing novel 
strategies for creating plants with enhanced resistance to pathogen infection. 

There now follows a description of the cloning of an Arabidopsis AR gene, NPRL A 
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dcscription is also provided of the cloning of the NPRl homolog from Nicotiana glutinosa. 
These examples are provided for the purpose of illustrating the invention, and should not be 
construed as limiting. 

Genetic Analysis of SAR in Arabidopsis and the Isolation of nnrl Mutants 
5 Using Arabidopsis thaliana, components of the signalling pathway in SAR 

downstream of SA and INA induction have been identified. Specifically, we sought 
Arabidopsis mutants that did not express PR genes in the presence of added SA or INA. 
Because there is no visible phenotype known to be associated with such mutants, transgenic 
Arabidopsis plants were generated which expressed P-glucuronidase (GUS) under the control 

10 of the Arabidopsis p-l,3-glucanase (BGL2) promoter (Dong et al., Plant Cell 3:61-72, 1991 ). 
The BGL2 gene is one of the PR genes regulated by SA (Uknes et al., Plant Cell 4:645-656, 
1992). Briefly, seed from the transgenic line (BGL2-GUS) were mutagenized with ethyl 
methanesulfonate (EMS), and the resulting mutants were screened after SA or INA treatment 
for aberrant expression of GUS. The results of these screenings showed that high levels of 

1 5 P-glucuronidase (GUS) activity could be assayed in a single well of a ninety-six well 

microtiter plate using a single leaf from a plant that had been grown for two weeks on plates 
containing SA or INA. Screens were performed for Arabidopsis mutants that either 
expressed the BGL2-GUS reporter constitutively in the absence of S A or INA treatment or 
that failed to express the reporter gene following treatment with SA or INA. These screens 
,20 led to the identification of a series of mutants called cpr and npr (constitutive expresser of PR 
genes and for non-expresser of PR genes, respectively) which define genes that are involved 
both in the regulation of BGL2 specifically and SAR in general (Bowling et al., Plant Cell 
6:1845-1857, 1994; Cao et al., Plant Cell 6:1583-1592, 1994). 
Construct ion of BGL2-GUS Transgenic Arabidopsis 

25 An Xbal-SphI fragment (2025 base pairs (bp)) containing 1 746-bp of noncoding 

sequence upstream of the start codon of the Arabidopsis BGL2 gene was fused at the ATG 
site to the coding region of the Escherichia coli uidA gene (referred to as the GUS gene) and 
transferred into the vector pBHOl, which was then used to transform Arabidopsis ecotype 
Columbia (Valvekens et al., Proc. Natl Acad. ScL USA 85:5536-5540, 1988). Plants 
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homozygous for the BGL2-GUS construct were identified on the basis that progeny of these 
plants were resistant to kanamycin and the presence of the transgene that was detected using 
Southern hybridization. 

Mutagenesis of the BGL2-GUS Transgenic Line 
5 Mutagenesis was performed in the BGL2-GUS/BGL2-GUS transgenic line by 

exposing -36,000 seeds to 0.3% ethyl methanesulfonate for eleven hours. Seeds were sown, 
and the plants were allowed to self-fertilize to produce M 2 seeds, which were collected in 
twelve independent pools. 

Identification of the nprl-1 Mutant 

1 0 The M 2 seeds were germinated on MS medium with the addition of 0.8% agar, 0.5 

mg/mL Mes (2-(A r -morpholino)ethane-sulfonic acid), pH 5.7, 2% sucrose, 50 /ug/tnL 
kanamycin, and 100 ,ug/mL ampicillin. Either 0.5 mM salicylic acid (SA) or 0.1 mM IN A 
was added to induce systemic acquired resistance (SAR). After incubation for fifteen days, 
each seedling to be assayed was numbered, and a single leaf was then removed from each 

1 5 seedling and put into the corresponding sample well of a ninety-six- well microtiter plate that 
contained 100 of P-glucuronidase (GUS) substrate solution (50 mM Na 2 HP0 4 , pH 7.0, 10 
mM Na 2 EDTA, 0.1% Triton X-100, 0.1% sarkosyl, 0.7 jAJmL pmercaptoethanol, and 0.7 
mg/mL 4-methylumbelliferyl P-D-glucuronide). After all the samples were collected, the 
microtiter plate was placed under vacuum for two minutes to infiltrate the samples and then 

20 incubated at 37 °C overnight. Samples were examined for the fluorescent product of GUS 
activity (4-methylumbellifone) using a long-wavelength UV light. Those seedlings which 
showed no GUS activity were identified on the MS plate and transplanted to soil for seed 
setting. This procedure was repeated in the progeny of these putative mutants to ensure that 
the mutant phenotype was heritable and to identify the homozygous mutants. Of 13,468 M 2 

25 plants tested, 181 did not exhibit GUS activity in the presence of either SA or IN A. In the M 3 
generation, 77 of 139 lines tested maintained a mutant phenotype for GUS activity, with 76 
nonresponsive to both SA and INA and one line nonresponsive to SA but responsive to INA. 

Three classes of mutations were predicted to be carried by the mutants that were 
nonresponsive to SA or INA treatment: (I) mutations in regulatory genes which not only 
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afFect expression of the transgene, but also the endogenous PR genes; (2) mutations in the 
promoter of the transgene which affect the responsiveness of BGL2-GUS, but not that of the 
endogenous PR genes to SA and INA; and (3) mutations in the coding region of the GUS 
gene which abolish the enzymatic activity of GUS, but not the transcription of GUS mRNA. 
5 To distinguish between these classes, the expression of endogenous PR genes was analyzed 
in the M 3 generation. Regulatory gene mutants should be readily distinguished in the M 3 
generation by an aberrant level of expression of other SAR-related PR genes. 

RNA gel blot analysis was performed with these 77 mutant lines to identify those 
with modified expression of PR genes. The expression of the Arahidopsis mitochondrial 

1 0 p-ATPase gene served as a control for sample loading. Among the 77 mutant lines, six were 
found to have reduced expression of the endogenous PR genes to some degree (class I); three 
showed aberrant expression only in BGL2-GUS (class 2); and fourteen were found to have 
reduced GUS activity but normal transcription of BGL2-GUS (class 3). One class I mutant 
(nprl-1) exhibited a dramatic reduction in expression of the GUS y BGL2, and PR-1 genes 

1 5 compared to the wild-type in the presence of SA or INA. Therefore, nprl-1 was selected for 
further study. 

The nprl-1 mutant was tested for the induction of PR-5 y another PR gene that has 
been cloned in Arahidopsis (Uknes et al., Plant Cell 4:645-656, 1992), and a similar 
reduction in expression was observed. The reduction in PR gene expression after S A or INA 

20 treatment was quantified for nprl-1 relative to the parent BGL2-GUS line (representing the 
wild-type). In nprl-1, the expression of both GUS and BGL2 was ten-fold lower than that of 
the wild-type and that of PR-5 was five-fold lower. The most dramatic reduction was 
observed for PR-1 which was twenty-fold lower than the wild-type. 
Quantitative GUS Assa ys Using nprl-1 

25 To measure accurately the level of GUS activity, a quantitative GUS assay was 

performed on nprl-1 plants and the wild-type BGL2-GUS plants grown in the presence of 
either SA or INA, or in the absence of both. In the absence of an inducer, the background 
level of GUS activity was five-fold lower in the nprl-1 mutant than in the wild-type. 
Wild-type plants grown in the presence of 0.5 mM SA showed a fifty-two-fold increase in 
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GUS activity compared to the uninduced plants, whereas in the SA-induced nprl-1 plants, 
the increase in GUS activity was only seven-fold. Moreover, the induction by 0. 1 mM INA 
was forty-eight-fold for the wild-type versus five-fold for nprl-L Thus, while GUS activity 
in the SA- or INA-treated nprl-1 plants was somewhat induced, the activity was at most only 
5 slightly higher than the background level of the untreated wild-type. 
Genetic Analysis of the nprl-1 Locus 

A backcross of nprl-l/nprl-1 with its wild-type parent (NPRl/NPRl in the 
BGL2-GUS background) resulted in F, progeny {NPRl/nprl-1, sixteen plants were tested) 
with the same pattern of GUS staining (using 5-bromo-4-chloro- 

1 0 3-indolyl glucuronide [XGluc] as the substrate) observed in the wild-type after SA or INA 
treatment. GUS staining was not detected in the SA- or INA-treated nprl-l/nprl-1 
homozygous plants even after two days of incubation at 28°C. Self-fertilization of the F, 
plants produced F 2 progeny that segregated for GUS activity, intense staining or complete 
absence of staining, which were present with a ratio of 219:64 among the 283 F 2 plants 

1 5 examined, demonstrating that the mutant phenotype is recessive and due to a single nuclear 
mutation (x 2 =0.86; P>0.1). 

S A-. INA-. and Avirulent Pathogen-Induced Protection Against Pseudomongs 
syringae pv maculicola ES4326 Infection in Wild-Type and nprl-1 

To examine whether the lack of SA- or INA-induced PR gene expression would affect 

20 SAR protection against a virulent pathogen infection, fifteen-day-old wild-type and nprl-lj 
plants were treated with either 1 mM SA or 0.65 mM INA, and two days later were exposed 
to a P.s. maculicola ES4326 bacterial suspension. Significant protection was observed in the 
SA- or INA-treated wild-type plants with less than ten percent of plants showing slight 
yellowing. Chlorotic lesions developed in about ninety percent of the untreated wild-type 

25 control plants not pretreated with SA or INA. However, such SA- or INA-induced protection 
was not observed in nprl-1 mutant plants. Chlorotic lesions were clearly seen in over ninety- 
percent of untreated and at least eighty-percent of SA- or INA-treated plants. The symptoms 
on nprl-1 were also more severe than on the wild-type plants. Treatment with only 1 mM 
SA, 0.65 mM INA, or surfactant (0.01 % Silwet-77, used for the bacterial infection) had a 
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minimal effect on both the wild-type and the nprl-1 plants. 

The growth of P.s. maculicola ES4326 was measured in both wild-type and nprl-1 
plants that had been treated with water, SA, or INA two days before P. s. maculicola ES4326 
infection. Leaves were collected 0, 0.5 t 1.0, 2.0, and 3.0 days after bacterial infiltration. For 
5 the untreated wildtype plants, P.s. maculicola ES4326 proliferated 10,000-fold during this 
time period. However, for SA- or INA-treated wild-type plants, the growth of P.s. 
maculicola ES4326 was only about ten-fold, 1000 times lower than the untreated control. A 
Student's / test of the difference between the means at the three-day time point clearly showed 
that growth of the pathogen is inhibited in the wild-type plants treated with SA or INA 

1 0 compared to those sprayed with water (P<0.001 ). Such a dramatic difference in P.s. 

maculicola ES4326 growth, which resulted from SAR protection, was not observed in the 
nprl-1 plants, where a Student's t test showed no statistically difference in growth after three 
days for all conditions (P>0.05); the growth of P.s. maculicola ES4326 in nprl-1 plants was 
similar for mock-treated and either SA- or INA-treated plants. Comparing the untreated 

15 nprl-1 plants with the untreated wild-type, the level of P.s. maculicola ES4326 appeared to 
have reached saturation one day earlier in the mutant than in the wild-type. Moreover, the 
difference in P.s. maculicola ES4326 growth between the SA- or INA-treated wild-type and 
nprl-1 was 500- to 1000-fold. 

To test the response to an avirulent pathogen, the nprl-1 plants were infiltrated with 

20 P.s. maculicola ES4326 carrying an avirulence gene ayrRptl as described by Dong et al. 
(Plant Cell 3:61-72, 1991) and Whalen et al. (Plant Cell 3:49-59, 1991). A typical HR was 
observed in these nprl-1 plants as characterized by the rapid appearance of necrotic lesions, 
detection of autofluorescence in the cell wall regions of the infected cells, and inhibited 
growth of P.s. maculicola ES4326/avrRpt2. The ability of this avirulence gene to induce 

25 SAR in nprl-1 plants was then tested. To distinguish the inducing bacterial strain from the 
challenging strain, the bean pathogen Pseudomonas syringae pv phaseolicola strain NPS3 121 
(P.s. phaseolicola NPS3121; (Lindgren et al., J. Bacteriol 168:512-522, 1986)) containing 
the avrRpt2 gene was used to induce SAR in both the nprl-1 and wild^type plants. P.s. 
phaseolicola NPS3121 by itself caused no disease symptoms or visible HR on Arabidopsis 
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ecotype Columbia, while P.s. phaseolicola NPS3l2l/avrRpt2 elicited a strong HR (Yu et al. 3 
Mol. Plant-Microbe Interact 6:434-443, 1993). Three days after the inoculation, uninfected 
leaves on the same plants were challenged with the virulent pathogen P.s. maculicola 
ES4326, and the growth of P.s. maculicola ES4326 in the plants was measured. A significant 
5 reduction in bacterial growth was observed in the wild-type plants pre-inoculated with P.s. 
phaseolicola NPS3\2l/avrRpt2 compared to the mock treated samples (300- fold); however, 
no difference in P.s. maculicola ES4326 growth was detected in nprl-1 plants. 

Disease Symptoms and BGL2-GUS Ex pression Induced by P.s. maculicola ES4326 
Infection in Wild-Tvpe and nvrl-l 
10 P.s. maculicola ES4326 was able to establish infection in SA-, INA-, and avinilent 

pathogen-treated 

nprl-l plants as well as in the untreated plants. The lesions formed on the untreated mutant 
plants and the untreated wild-type were further compared. For this purpose, the P.s. 
maculicola ES4326 suspension was infiltrated into four-week-old wild-type and nprl-1 

15 leaves. The injection was controlled so that only half of the leaf was infiltrated with the 
bacteria. This could be monitored by the soaking appearance of the half-leaf. Forty-eight 
hours following infiltration, chlorotic lesions were visible on the wild-type leaves. These 
lesions were normally confined to the infiltrated halves of the leaves as defined by the midrib 
vein. Different lesions were observed on the nprl-1 leaves, where the lesions were more 

20 diffuse and often spread into the uninfected halves of the leaves. Sampling of twelve leaves 
from both wild-type and nprl-1 plants revealed significant growth of the bacteria in the 
uninoculated half of eleven nprl-1 leaves compared to none of the wild-type leaves. 

For the leaves infected with P.s, maculicola ES4326, the pattern of BGL2-GUS 
expression was examined by X-Gluc staining. In a wild-type leaf, a high level of GUS 

25 staining was detected in the peripheral region of the lesion. In contrast, no significant GUS 
activity was detected on the nprl-1 leaf, where the lesion was more extensive than on the 
wild-type. 

Conclusions About nprl-1 

The data described above indicates that nprl-1 harbors a trans-acting mutation(s) 
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affecting the response to SA and INA. The possibility of nprl -1 being a mutant affecting the 
uptake of exogenously applied SA or rNA is ruled out by the observation that the expression 
of PR1 induced by P.s. mdculicola ES4326, instead of by exogenously applied SA or INA, is 
also reduced in the nprl -I mutant. The failure of SA or INA to protect the nprl -J mutant 
5 from infection by P.s. maculicola strain ES4326 (in contrast to the protection observed in 
wild-type plants) indicated that the nprl-1 mutation blocks SA or INA induction of 
resistance. Even though the HR elicited in the nprl-1 mutant by bacteria carrying the 
avirulence gene avrRptl was similar to that described previously in wild-type plants (Dong et 
al. t Plant Cell 3:61-72, 1991; Whalen et aL Plant Cell 3:49-59, 1991), the HR-induced SAR 
1 0 protection against infection by the virulent pathogen P.s. maculicola ES4326 was absent in 
the nprl-1 plants. This indicated that nprl-1 is a mutation that prevents the onset of SAR. 
These phenotypes of the nprl-1 mutation indicated that the function of the wild-type NPR1 
gene is to qualitatively and quantitatively regulate the expression of SA- and INA-responsive 
PR genes. 

1 5 Genetic analysis of the progeny of an nprl-1 /nprl-1 XNPRl/NPRl backcross 

indicated that a single recessive nuclear mutation determines the "nonexpresser of PR genes" 
phenotype of the nprl-1 mutant. This also indicated that the NPR1 gene acts as a positive 
regulator of SAR responsive gene induction. While the gene could be a negative regulator 
which is inactivated by SAR induction, a mutation abolishing such regulation would likely be 

20 dominant. Furthermore, the fact that a single mutation (that is, nprl-1) affects the 

responsiveness of this mutant to SA-, INA-, and pathogen induction indicated that SA, INA, 
and pathogens activate a common pathway that leads to the expression of PR genes. 
Identificatio n of the Arabidopsis nor 1-2 and nnrl-3 Mutants 

To identify novel Arabidopsis mutants that negatively affect the induction of SAR, an 
25 alternative mutant screening strategy was employed. 

We have observed that the final density to which the virulent pathogen P.s. 
maculicola ES4326 will grow in an Arabidopsis leaf is directly related to the dose at which 
P.s. maculicola ES4326 was infiltrated. The observed phenotypes of two additional types of 
Arabidopsis mutants also supported this conclusion. Specifically, a series of Arabidopsis 
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mutants were identified that accumulated reduced levels of the phytoalexin called camalexin, 
a phytoalexin that has been found in significant quantities in Arabidopsis (Glazebrook and 
Ausubel, Proc. Natl. Acad. Sci. USA 91 :8955-8959, 1994; Tsuji et al., Plant Physiol 
98:1304-1309, 1992). Importantly, P.s. maculicola ES4326 formed disease lesions and grew 
5 to higher titers on some of these pad (phytoalexin deficient) mutants when inoculated at 
doses below the threshold dose required to give disease symptoms in wild-type plants. 
Similarly, nprl-1 mutants exhibited a similar enhanced susceptibility phenotype as pad 
mutants (Cao et al., Plant Cell 6:1583-1592, 1994). 

Based on these findings that pad and npr mutants were more susceptible to low dose 

1 0 P.s. maculicola ES4326 infection than wild-type plants, a screen was performed to isolate 
additional eds (enhanced disease susceptibility) mutants (Glazebrook et al., Genetics 
143:973-982, 1996). Two leaves of M2 generation mutagenized Arabidopsis plants were 
infected at a dose of strain P.s. maculicola ES4326 at which wild-type plants showed very 
weak symptoms manifested as small chlorotic spots three days after infection, whereas pad 

1 5 and npr J mutants showed large areas of chlorosis. A total of fifteen eds mutants that 
reproducibly allowed at least one half log more growth of P.s. maculicola ES4326 as 
compared to wild-type were identified among 12,500 plants screened. Because somtpad 
mutants as well as nprl-1 mutants have the same enhanced susceptibility phenotype with 
respect to P.s. maculicola ES4326 as the eds mutants (Glazebrook et al., Genetics 143:973- 

20 . 982, 1996), the fifteen eds mutants were tested to determine, whether they synthesized 
wild-type levels of camalexin in response to infection by P.s. maculicola ES4326 {pad 
phenotype) and whether PR] gene expression can be induced by salicylic acid (npr 1-1 
phenotype). The results of these analyses showed that two of the eds mutants exhibited an 
n/?r/-like phenotype. Genetic complementation analysis showed that these two mutations are 

25 allelic to nprl-1. These two mutants were re-named npr 1-2 and npr 1-3. 



Map-Based Positional Cl oning of the Arabidopsis NPR1 Gene 

To map the NPR1 gene, a genetic cross was made between the nprl-1 mutant (present 
in the Columbia ecotype (Col-O) which carried the BGL2-GUS reporter gene) and the wild- 
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type (present in Landsberg erecta ecotype (La-er) which carried the BGL2-GUS reporter 
gene). F3 families from this cross that are homozygous for this mutation at the NPR1 locus 
were identified by their lack of expression of BGL2-GUS when grown on plates containing 
0. 1 mM INA. Expression of the GUS reporter gene was detected by a chromographic assay 
5 of GUS activity using the substrate 5-bromo-4-chloro-3-indoIyl glucuronide according to 
standard techniques (Cao et al., Plant Cell 6:1 583-1 592, 1 994 and Jefferson Plant Mol Biol 
Reporter 5:387-405, 1987). The leaf tissues of these F3 nprl-1 progeny pools (from thirty to 
forty two-week-old seedlings) were collected and frozen in liquid nitrogen. From the frozen 
tissues, genomic DNA preparations were made as described by Dellaporta et al. {Plant Mol. 

1 0 Bioi Reporter 1 : 1 9-2 1 , 1 983) and used to determine the genotypes of various restriction 
fragment length polymorphism (RFLP) and codominant amplified polymorphic sequence 
(CAPS) (Konieczny and Ausubel, Plant 1 4:403-410, 1993) markers. The frequencies of 
recombination between the NPR1 locus and the RFLP and CAPS markers were used to 
determine the position of the NPR1 gene according to conventional methods. 

15 As shown in Fig. 1, the NPRJ gene was mapped to Arabidopsis chromosome I, and 

found to reside between the CAPS marker GAP-B (-22.70 cM on the centromeric side of the 
NPR1 gene) and the RFLP marker m315 (-7.58 cM on the telomeric side of the NPR1 gene). 

To cairy out fine mapping of the NPRJ gene, new CAPS and RFLP markers were 
generated from clones that the genetic maps in the AtDB database (http://genome- 

20 www.stanford.edu/Arabidopsis/) showed were located between GAP-B and m315. Cosmid 
g4026 (CD2-28, Arabidopsis Biological Resource Center, The Ohio State University, 
Columbus, OH) was cut with the restriction enzyme EcoEl and a 4-kb fragment was used to 
identify a polymorphism between Col-0 and La-er after the genomic DNA was digested with 
HindUL Using this RFLP marker, six heterozygotes were detected among the twenty-three 

25 F3 families that were heterozygous at GAP-B. None were found among the seven F3 families 
that were heterozygous at mil 5. Therefore, g4026 is -5.92 cm on the centromeric side of the 
NPR1 gene. Cosmid gll447 (obtained from the collection of Dr. Howard Goodman at the 
Massachusetts General Hospital (Nam et al., Plant Cell 1 :699-705, 1 989)) was used to 
generate a CAPS marker. End-sequences of an 0.8-kb EcoEl fragment were used to design 
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PCR primers (primer i: 5' GTGACAGACTTGCTCCTACTG 3' (SEQ ID N0:15); primer 2: 
5* CAGTGTGTATCAAAGCACCA 3' (SEQ ID NO: 16) which amplified a fragment 
displaying a polymorphism when digested with the EcoRV restriction enzyme. Among the 
436 nprl-1 F3 progeny tested using this newly generated CAPS marker, seventeen 
5 heterozygotes were discovered. Since these heterozygotes were all homozygous Col-0 for the 
GAP-B locus, the g!1447 marker was placed - 1 .95 cM on the telomeric side of the NPR] 
gene. 

There are a number of RFLP markers mapped between g!1447 and g4026. The first 
marker tested was m305 (designated CD\-\\,Arabidopsis Biological Resource Center, the 

1 0 Ohio State University, Columbus, OH (Chang et al., Proa Natl. Acad ScL USA 85:6856- 
6860, 1988)). A 5-kb EcoRl fragment isolated from the m305 lambda clone was further 
subcloned using SaWXbal and the end-sequences of a 1 .6-kb fragment were used to design 
PCR primers (primer 1: 5' TTCTCCAGACCACATGATTAT 3'(SEQ ID NO: 17); primer 2: 
5' TGAAGCTAATATGCACAGGAG 3* (SEQ ID NO: 18)). The resulting PCR fragment 

1 5 amplified using these primers was digested with Haelll to detect a polymorphism. Among 
the 305 nprl l progeny examined using this m305 CAPS marker, no heterozygotes were 
found, indicating that the m305 marker lies extremely close to NPRL 

A partial physical map of chromosome I 

20 (http://cbil.humgen.upeim.edu/--atgc/ATGCUP.htmI) showed a YAC contig that includes . 
m305. The YACs in this contig, as well as left-end-fragments of YAC clones yUP19H6, 
yUP21 A4, and yUPHH9 were obtained from Dr. Joseph Ecker at the University of 
Pennsylvania. The yUP19H6L end-probe was found to detect an Rsal polymorphism, and 
five recombinants were identified among the GAP-B recombinants on the centromeric side of 

25 the NPR J gene (as shown by the vertical arrows in Fig. 1 ). The yUPl 1H9L end-probe was 
found to detect a HindUl polymorphism, and one heterozygote was found among the 
seventeen recombinants for gll447 on the telomeric side of the NPR1 gene (as shown by a 
vertical arrow in Fig. I). Since yUPl 1H9L hybridized with the yUP19H6 YAC clone, these 
results showed that the NPR! gene is located on yUP19H6. In addition to m305 y yUP21 A4L 
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(detects an EcoRl polymorphism) and g8020 (a 1 .3-kb EcoRl fragment that detects a HindlU 
polymorphism) were found to be very closely linked to the NPRJ gene with no recombinants 
identified. m305 7 yUP21 A4L, and g8020 all hybridized to the yUP19H6 YAC clone, further 
supporting the conclusion that yUP19H6 contains the NPRJ gene. 
5 Construction of a Cosmid Library from the YAC Clone VUP19H6 

A genomic DNA preparation was made from the yeast strain containing the YAC 
clone yUP19H6. This DNA was partially digested with the restriction enzyme Taql, size 
selected on a 10-40% sucrose gradient, and cloned into the Cla\ site of the binary vector, 
pCLD04541 (obtained from Dr. Jonathan Jones (Bent et al„ Science 265:1856-1860, 1994)). 
1 0 The pCLD04541 vector is a standard transformation vector used for preparing cosmid 

libraries. This plasmid carries a T-DNA polylinker region, and tetracycline and kanamycin 
resistance markers. 

The cosmid clones were packaged into bacteriophage lambda particles using a 
commercial packaging extract (Gigapack XL, Stratagene, LaJolla, CA) and introduced into E. 

1 5 coli strain DH5a according to the instructions of the supplier. The resulting library was 
found to contain approximately 40,000 independent clones. 
Generation of a Cosmid Co ntip Containing the NPR1 Gene 

The cosmid library generated from the yeast strain containing yUPl 9H6 was plated 
(1,500 cfu/plate) on LB medium agar (containing 5 ng/mL of tetracycline to select for the 

20 presence of pCLD04541 ) and incubated at 37 °C overnight. Colonies were lifted onto 

membranes (GeneScreen, Du Pont, New England Nuclear) and hybridization was carried out 
according to the protocol described by the manufacturer. The library was probed with 5-kb 
EcoRJy 6.5-kb EcdRl/XhoI, and a 1 .3-kb EcoRI fragments prepared from m305, yUP2 1 A4L, 
and g8020> respectively. The colonies that hybridized with these probes were identified and 

25 purified according to conventional methods. Cosmid DNA preparations were made from 

these positive clones using the alkaline lysis method described by Sambrook et al. (Molecular 
Cloning: A Laboratory Manual, Cold Spring Haibor Laboratory Press, New York, 1989), and 
the inserts were analyzed by HindUl restriction digestion and Southern hybridization using 
the probes stated above. The cosmids were found to form a single cosmid contig spanning 
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approximately 80-kb of Arabidopsis DNA. Three of the five recombinants for yUP19HL 
were shown to be heterozygous at an RFLP marker detected by cosmid clone m305-ZA (a 
5-kb HindRl fragment) at the centromeric side of the contig, while the single heterozygotc 
detected by g8020 marker was also detected by the cosmid clone g8020-6-3 (a 1.25-kb 
5 HindUl fragment) at the telomeric side of the contig. This showed that the cosmid contig 
contained the NPR1 gene (Fig. 1). From this contig, fourteen cosmids which each have a 
minimum of 1 0-kb overlap with the neighboring clones (Fig. 1 ) were chosen to transform 
nprJ mutant plants in complementation experiments. 
Complementation of the nprl Mutations 

10 The cosmid clones contained in the E. colt strain DH5a were transferred into the 

Agrobacterium tumefaciens strain GV3101 (pMP90) (Koncz and Schell, MoL Gen. Genet. 
204:383-396, 1986) by conjugation using the helper strain MM294A (pRK2013) (Finan et 
al., J. Bacteriol 167:66-72, 1986). The resulting /I. tumefaciens conjugants were selected 
using 50 ^g/mL kanamycin and 50 iig/mL gentamycin. The A. tumefaciens strains carrying 

15 those fourteen cosmid clones were transformed into npr 1-1 (Cao et al., Plant Cell 

6:1583-1592, 1994) and nprl-2 (Glazebrook et al., Genetics 143:973-982, 1996) using a 
vacuum infiltration method described by Bechtold et al. (C.R. Acad. Sci. Paris, Life Sciences 
316:1194-1199, 1993). The integrity of the cosmid clones in the A. tumefaciens cultures used 
for transformation were examined by Southern analysis. 

20 Transformants of nprl-2 were grown (22 °C in fourteen hours of light) and selected on 

MS medium agar (Murashige and Skoog, Physiol Plant. 15:473-497, 1962) containing 2% 
sucrose, 50 fig/mL kanamycin, and 100 ng/mL ampicillin. Kanamycin-resistant 
transformants which developed true leaves and healthy roots were transplanted to soil. After 
two weeks of growth in soil at 22 °C in fourteen hours of light per day, leaves were collected 

25 from three transformants of each cosmid clone and soaked in 0.5 mM INA solution for 

twenty-four hours at 22 °C in fourteen hours of light per day. Leaf tissues were then collected 
and frozen in liquid nitrogen. Total RNA was extracted from these leaf tissues, and an RNA 
blot was prepared as described by Cao eXzi. (Plant Cell 6:1583-1592, 1994). The blot was 
probed with a P/M-specific probe (a PCR product obtained by amplifying genomic 
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Arabidopsis DNA with PR] -specific primers (sense primer 5 1 
GTAGGTGCTCITGTTCTTCCC3' (SEQ ID NO: 19); anti-sense primer 
5^ACATAATTCCCACGAGGATC3' (SEQ ID NO:20)). 

In control experiments, the wild-type parental line showed the induction of the PR1 
5 gene by INA, while the nprl-2 mutant exhibited no induction oiPR-1 gene expression. 
Nprl-2 transformants containing cosmids (three for each cosmid) 21A4-6-1-1, 21 A4-P5-1, 
21 A4-4-3-1, and 21 A4-2-1 showed strong induction ofPRl by INA, while nprl-2 
transformants containing other clones (for example, M305-2-3, M305-3-9, and 21 A4-3-1) 
displayed no induction. Variations were observed in the intensity of RNA bands among three 

1 0 individual transformants sampled for each cosmid clone. These variations were likely to be 
the result of "position-effects," the effect of the insertion site in the chromosome on the 
expression of the transgene. Cosmid clones 2 1 A4-4-3- 1,21 A4-6- 1 -1 , 2 1 A4-P5-1 , and 
21 A4-2-1 restored the ability of the nprl-2 mutant to respond to INA induction and, 
therefore, complemented the nprl-2 mutation. Examples of INA induced PR1 are shown in 

15 Fig. 2 A. 

Transformants carrying each cosmid were also tested for SA induction of PR I 
expression by RNA blot analysis Examples of SA induction are shown in Figure 2A. The 
wild-type parental line exhibited a high level oiPRl gene induction by SA, whereas the 
nprl-2 mutant exhibited only a minor induction (Fig. 2A). Transformants of the nprl-2 
20 mutant containing cosmids 21 A4-6-1-1, 21 A4-P5-1, 21A4-4-3-1, and 21A4-2-1 showed 
induction of PR1 by SA, while those containing the other clones displayed little induction. 

As shown in Fig. 1, these four clones share a common region of 7.5-kb. 
Transformants of cosmid 21 A4-P4-1 were not available when the experiment described 
above was conducted. However, according to its relative position, it is expected that this 
25 clone can also complement the nprl-2 mutation. 

The same fourteen cosmid clones were also transformed into the nprl-1 mutant. 
Since the nprl-1 mutant carries the BGL2-GUS reporter and the kanamycin resistance gene 
(NPTII), transformants of the cosmid clones could not be selected using kanamycin. Instead, 
transformants that complemented the nprl-1 mutation were selected directly by growing the 
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seeds collected from the nprl-l plants infiltrated with A. tumefaciens on a high concentration 
of SA (0.5 mM). Those plants that developed green leaves were transplanted to another plate 
containing 0. 1 mM 1NA, and GUS activity was measured one week after transplanting. 

To measure GUS activity, seedlings were numbered, and a single leaf was removed 
5 from each plant and placed in a microtiter well containing 100 \iL of GUS substrate 

(4-methylumbelliferyl P-glucuronide) in a solution as described previously (Cao et al., Plant 
Cell 6:1583-1592, 1994; Jefferson, Plant Moi Biol. Reporter 5:387-405, 1987). After an 
overnight incubation at 37 °C, the fluorescent product of GUS activity was examined under a 
long wavelength UV light. As controls, twelve seedlings of the wild-type parental line 

1 0 (BGL2-GUS) were tested, and all showed intense fluorescence after growth on S A and INA. 
Twelve seedlings of the nprl-l mutant (BGL2-GUS) were also included in the experiment, 
and none displayed any increase in fluorescence. From this experiment, nine seedlings 
carrying cosmid 21 A4-P4-1, five carrying 21 A4-P5-1, and six carrying 21 A4-2-1 were found 
to have high levels of fluorescence, i.e., GUS activity, and none of the seedlings from other 

15 cosmid clones were identified through this selection. Direct identification of putative 

complementing transformants in the nprl-l mutant plants by the cosmid clones 21 A4-P4-1 , 
21 A4-P5-1, and 21A4-2-1 as in the transformation experiment using the allelic nprl-2 mutant 
(where all transformants were first selected by kanamycin resistance before identification of 
the transformants that could complement the nprl-2 mutation using RNA blot analysis) 

20 further supported the conclusion from complementation experiments with nprl-2 that the 7.5 
kb region shared by cosmids 21A4-4-3-1, 21 A4-6-1-1, 21A4-P5-1, 21 A4-P4-1, and 21 A4-2-1 
complemented nprl mutations, and that this 7.5-kb region contained the NPR1 gene. 

In addition to reduced PR gene expression, plants with nprl mutations display 
susceptibility to virulent pathogens even after SAR induction. These mutant phenotypes 

25 were also complemented by the cosmids described above. For example, as shown in Figure 
2B, infection by the bacterial pathogen Psm ES4326 caused visible disease symptoms three 
days after infection. While the disease symptoms in the wild-type plants and the 
complemented nprl-l transformants were well-confined to the site of pathogen infiltration 
(the left side of the leaf), the lesions in the nprl-l plants were found to spread beyond the site 

30 of infiltration. In addition, when the dosage of infecting bacteria was reduced 1 0-fold, severe 
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disease symptoms were only observed in the nprl-l mutant (leaves on the right). This 
experiment showed that 21A4-4-3-1 complemented the enhanced susceptibility to Psm 
ES4326 displayed by nprl-L 

The expression of the BGL2-GUS gene was also analyzed in the same leaves after 
5 examination of the disease symptoms (Fig. 2B). Strong GUS expression (blue staining) was 
detected in the marginal regions of the well-confined lesions in the wild-type plants, but was 
absent from the diffuse lesions in the nprl-l plants. Reporter gene expression was restored in 
complemented transform ants. 

In addition to these visual observations, as shown in Fig. 2C, bacterial growth of Psm 

10 ES4326 was measured quantitatively in wild-type, nprl-2, and an nprl-2 transformant with a 
complementing cosmid (21A4-P5-1). Plants were treated with 0.65 mM INA seventy-two 
hours prior to Psm ES4326 infection (OD^ = 0.00 1 ). Infection of Arabidopsis with Psm 
ES4326 was performed according to standard methods (Bowling et al., 1994; supra, Cao et 
al., supra, 1994; Glazebrook et al., supra, 1996). Samples were taken before infection and 

1 5 one, two, and three days after infection. Six to eight samples were taken for each time point 
analyzed and colony-forming units of Psm ES4326 were determined per leaf disc. Complete 
inhibition of Psm ES4326 growth was observed in the wild-type plants following INA 
treatment three days prior to infection, whereas an approximate 10- fold decrease in Psm 
ES4326 growth was observed in the nprl-2 mutant subjected to the same treatment. The 

20 growth of Psm ES4326 was also halted in the complemented transformants after INA 
treatment. Lower bacterial growth (as great at 10 3 -fold) was observed even in the water- 
treated transformants compared to the water-treated wild-type (Fig. 2C) and the water-treated 
transformants carrying noncomplementing cosmids. This enhanced resistance may result 
from the increased NPR1 mRNA levels in these complemented transformants. 

25 A test of resistance to a fungal pathogen, P. parasitica NOCO, was also performed to 

verify complementation of the nprl-l mutation. Infection of Arabidopsis with P. parasitica 
NOCO was performed according to standard methods (Bowling et al., supra, 1994; Cao et 
al., supra, 1994; Glazebrook et al., supra, 1996). INA treatment (0.65 mM) was carried out 
seventy-two hours prior to infection with a spore suspension (3 x 10 4 spores/ 1 mL). Seven 

30 days post-infection, the disease symptoms were scored with respect to the number of 
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conidiophores observed on each plant. A total of twenty to twenty-five plants were examined 
for each genotype with each treatment. Data were analyzed using the Mann-Whitney U-Tests 
(Sokal and Rohlf, supra). As shown in Fig. 2D, the results of these experiments indicated 
that INA-induced resistance to P. parasitica NOCO was restored in the trans formants with 
5 the complementing cosmids. 

Anal yses of the 7.5-kb Region Containing the NPRI G?n e 

The 7.5-kb region identified by the cosmid complementation experiment was further 
analyzed using restriction enzymes. The resulting restriction map from this analysis is shown 
in Fig. 3. Three sets of subclones were made using Hindlll, Xbal, and ClaVXhol digestions 

10 of the cosmid 21 A4-P5-1, which has the 7.5-kb region located in the center of the insert, and 
ligated into the vector pBluescript II SIC (Stratagene, La Jolla, CA). The 7.5-kb region of 
interest was represented by five HindlU subclones with the approximate insert sizes 1 .96-kb, 
1.91-kb, 1.74-kb, 1.25-kb, and 0.50-kb. Subclones with larger inserts (Xbal: ~8.5-kb, -8.5- 
kb, - 1 .45-kb; ClaVXhol: ~ 1 0.0-kb, and ~ 5. 1 -kb) were also made to orient and connect these 

15 Hindlll fragments. 

A Southern blot containing the //wrflll-digested genomic DNA samples from the 
wild-type parental line (BGL2-GUS) and the three nprl mutants was examined with probes 
generated from Hindlll fragments made from the cosmid clone 21 A4-P5-1 . No significant 
difference in the restriction patterns was observed between the wild-type and all three nprl 

20 allelic mutants. Therefore, it is unlikely that these mutants carried a substantial deletion in 
the NPRI gene. 

DNA fragments covering the 7.5-kb region were used to detect transcripts on a blot 
containing the polyA mRNAs made from four-week-old plants of the wild-type parental line 
and of the three nprl allelic mutants seventy-two hours after treatment of the plants with H 2 0 

25 or 0.65 mM INA and 2 mM SA. The polyA mRNA samples were prepared using Dynabeads 
(Dynal, Inc., Lake Success, NY) from seventy-five micrograms of total RNA according to the 
protocol provided by Dynal From this analysis, only one ~2.0-kb mRNA was detected in the 
7.5-kb region using probes made from the 0.5-kb and the adjacent 1 .96-kb Hindlll fragments. 
This mRNA represented a putative transcript of the NPRI gene. In addition, the intensity of 

30 this transcript was about two-fold higher in the DMA/SA-induced samples compared to the 
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H 2 Otreated controls as measured by a Phosphorlmager and ImageQuant (Molecular 
Dynamics, Sunnyvale, CA). Thus, the expression of this transcript believed to represent 
mRNA of the NPR1 gene was induced by INA/SA treatment. No significant difference in the 
pattern of expression was discovered between the wild-type and three nprl mutant alleles on 
5 this polyA RNA blot. 

Se quence Analysis of the NPR1 Gene 

The initial sequencing analysis was carried out using pBluescript SK 1 clones of the 
five Hindlll fragments as templates. The template DNA samples were prepared using Qiagen 
Plasmid Mini Kits (Qiagen Inc., Chatsworth, CA), and 0.6 fig of the template was used for 

10 each sequencing reaction and analyzed by an AB1 automated sequencer. 

M13-20 and M13 reverse primers were used to initiate the sequencing reactions of the 
Hindlll fragments. Various restriction enzymes were then used to generate deletions in these 
Hindlll subclones to analyze sequences more distal to the ends of the fragments. In addition, 
primers were designed to perform primer walking. The relative positions of these Hindlll 

1 5 fragments were determined and gaps between these fragments were filled by sequencing 
analyses using ATwzI-subclones of cosmid 21 A4-P5-1 as templates. The sequence data were 
analyzed to identify restriction enzyme sites, to perform sequence alignment and to search for 
open reading frames using standard DNA analysis software (DNA Strider 1 .1, Mac Vector 
4.0.1, and GeneFinder). Using this software only one putative gene was found. Sequence 

20 data were also compared to the TIGR Arabidopsis thaliana DataBase 

(http://www.tigr.org/tdb/at/at.html). The results of this study identified an expression 
sequence tagged (EST) clone that showed homology with a portion of the 1.96-kb fragment. 
This portion of the 1 .96-kb fragment was also identified as part of the gene recognized using 
GeneFinder software. The nucleotide sequence of the 7.5-kb genomic region encoding the 

25 NPR1 gene product is shown in 
Fig. 4. 

Isolation of NPR1 c DNA Clones 

A cDNA library that was constructed by Dr. Katagiri (and described in detail in 
Mindrinos et al, Cell 78:1089-1099, 1994) was screened using the 1.96-kb Hindlll fragment 
30 as a probe. Bacterial cells (E coli DH10B; GIBCO BRL, Gaithersburg, MD) containing 
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cDNAs made from the aerial parts of one-month old wild-type Arahidopsis plants in vector 
pKEx4tr were plated (60,000 cfti/plate) on LB medium containing 100 \ig/mL ampicillin, and 
the plates were incubated at 37 °C for four and one-half hours. Colonies were lifted onto 
Colony/Plaque Screen membranes (NEN Research Product; Boston, MA), and then the 
5 membranes were placed onto an LB plate, with the colony side up. Both plates were 

incubated at 30°C for twelve hours. The membranes were autoclaved for one minute to lyse 
the cells and fix the DNA to the membrane. Hybridization was performed at 42 °C in a 
solution containing 10% dextran sulfate, 50% formamide, 6X SSC, 5X Denhardt's, and 1% 
SDS; and the membranes were washed twice at 65 °C in 2X SSC and 1% SDS. The positive 

10 colonies were purified through secondary and tertiary screens using identical conditions. One 
positive cloned was subsequently identified and designated pKExNPRl. 

The cDNA inserts were excised from the vector using restriction enzymes EcoRl and 
Sac\. Southern analysis was performed using probes made from the 1 .96-kb (the 3'-end of 
the open reading frame) and the 0.5-kb (the 5-end of the open reading frame) Hindlll 

15 fragments to confirm homology of the cDNA clones. The nucleic acid sequence (SEQ ID 
NO:2) and deduced amino acid sequence (SEQ ID NO:3) of the acquired resistance protein 
termed NPR1 from Arahidopsis thaliana encoded by the 2.1-kb cDNA is shown in Fig. 5. 
Sequence analysis revealed that this cDNA contained sequences corresponding to those 
identified in the EST clone and deduced using the Gene Finder software. 

20. The cDNA sequence was analyzed using the BLAST sequence analysis program. 

This analysis revealed that the NPR1 protein shared significant homology with ankyrin, 
including the region identified as the ankyrin-repeat consensus. In particular, as shown in 
Fig. 6A, the NPR1 sequence contains two regions with significant homology to the 
mammalian ankyrin 3 gene. The sequence identities between NPR1 (amino acids 323-371 

25 and 262-289) and ANK3 (amino acids 740-788 and 3 13-340) are 42% and 35%, respectively, 
and the sequence similarities are 59% and 57%, respectively. This ankyrin-repeat consensus 
has been identified in a diverse array of proteins including transcription factors, cell 
differentiation molecules, structural proteins, and proteins with enzymatic and toxic 
activities. This motif has been shown to function by mediating protein interactions. 

30 Using the consensus sequence defined by Michaely and Bennett (Trends in Cell 
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Biology 2:127-129, 1992) and Bork (Proteins: Structure, Function, and Genetics 17:363-374, 
1993), two additional ankyrin repeats were identified in NPR1\ these are shown in Fig. 6B. 

In addition, using the Mac Vector program, a 1 7 amino acid motif of G-protein 
coupled receptors (MKGTCEFIVTSLEPDRL, Fig. 5, SEQ ID NO:21) has been found in the 
5 NPR1 protein (Science 244:569-572, 1 989). 



The NPR1 -determined Resistance is Do sa ge Dependent 

The ability of NPR-1 to confer disease resistance was evaluated in transgenic plants as 
follows. The NPR1 cDNA sequence (Fig. 5; SEQ ID NO:2) driven by the constitutive 

1 0 CaMV 35S promoter was transformed into Arabidopsis ecotypc Columbia according to 
standard methods. In the resulting T 3 lines homozygous for the 35S-NPR1 transgene, the 
expression of the NPR1 -regulated PR-1 gene, NPR1 mRNA, and NPR1 protein were 
measured to identify those lines exhibiting high (ColNPRlH), medium (ColNPRIM), and 
low (ColNPRIL) levels of NPR1 expression. Table 1 shows the results of evaluating the 

15 relative levels of PR-1, NPR1 mRNA, and NPR1 protein concentrations. 
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Table 1 

Characterization of 35S-NPR1 Tra nsgenic Lines 







PR-1 


NPR1 


NPR1 


5 


Genotype 


(INA) a 


(mRNA) b 


(Protein) 1 




Col 


1.00 


1.00 


1. 00 




Col-Ll 


0.41 


6.92 


0.04 


10 












Col-L2 


0.54 


6.90 


<0.04 




Col-Ml 


1.73 


9.20 


1.40 


15 


Col-M2 


1.80 


9.50 


1.40 




Col-Hl 


2.60 


17.80 


1.60 




Col-H2 


2.74 


27.90 


3.00 



20 



a The relative levels of PR-1 were measured by an RNA blot analysis in the 35S-NPRI transgenic lines grown 
on plates containing 0. 1 mM INA. 

b The relative levels of NPR1 mRNA were measured by a polyA+RNA blot. 
25 c The relative NPR1 protein concentrations were measured by ELISA using NPRI polyclonal antibodies. 

From these experiments, two lines of transformants were identified that had 
significantly lower NPRI protein levels (but not mRNA levels) than the wild-type parent. 

30 This, however, was not unexpected because overexpression of a transgene in plants often 
leads to co-suppression of the transgene as well as the corresponding endogenous gene 
(Baulcombe, The Plant Cell. 8:1833, 1996). 

The high-, medium-, and low-expressing 35S-NPR1 transgenic lines were next 
subjected to infection by the bacterial pathogen Pseudomonas syrinigae pv maculicola 

35 ES4326 and the fungal pathogen Peronospora parasitica NOC02 according to standard 
methods. The results of these experiments are shown in Figs. 8A and 8B, respectively. In 
the absence of SAR induction, the high- and the medium-expressing 35S-NPR1 transgenic 
lines showed significantly increased resistance to both bacterial and fungal pathogens while 
the low-expressing transgenic lines displayed reduced tolerance to the pathogens as compared 
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to the wild-type. Together, these results showed that NPR1 was a positive regulator of SAR, 
and that the NPR1 -determined resistance was dosage dependent; overexpression of the NPR1 
protein enhanced resistance whereas underexpression led to reduced tolerance to infection. 
NPR1 is Translocated to the Nucleus Upon SA Induction 
5 To elucidate the induction mechanism and the molecular function of the protein, the 

subcellular localization of NPR1 was determined by using standard reporter gene fusion 
construct analysis. The green fluorescent protein (GFP) gene was fused to the carboxyl end 
of the NPR1 cDNA driven by the constitutive CaMV 35S promoter, and the 35S-NPR1-GFP 
construct was used to transform nprl mutants, nprl-l and nprl-2 t according to standard 

10 methods. In the resulting transgenic lines, the NPR1 -GFP transgene was found to 

complement all the nprl mutant phenotypes; namely, the lack of SA- or INA-induced PR 
gene expression, the reduced tolerance to exogenous SA, and the lack of SA- or INA- 
induced resistance to pathogens (Figs. 9A-9C). Transgenic lines expressing the GFP alone 
(designated 35S~mGFP), exhibited no complementing activity (Fig. 9B). In addition, the 

1 5 presence of the NPR-GFP transgene was found to restore both inducible BGL-GUS 

expression and resistance to P. parasitica as shown in Figs. 9A and 9C, respectively. These 
experiments therefore showed that the NPR1-GFP was biologically active and that the 
subcellular localization of NPR1-GFP should reflect that of the endogenous NPR1 protein. 
To examine the subcellular localization of the NPR1 protein, the 35S-NPR1-GFP and 

20 35S-mGFP transgenic lines were grown in MS medium in the presence or absence of the 

SAR-inducing chemicals SA or INA. Eleven-day-old seedlings were subsequently examined 
using confocal microscopy to detect localization of NPR1-GFP and mGFP. As shown in Fig. 
10, the 35S-NPR1-GFP seedlings grown on MS showed low levels of GFP throughout the 
mesophyll cells and strong GFP fluorescence in the nuclei of the guard cells. Upon induction 

25 by SA or INA, NPR1-GFP was detected exclusively in the nuclei of both the mesophyll cells 
and the guard cells. In the 35S-mGFP transformants, green fluorescence was detected in the 
cytoplasm as well as in the nuclei, and S A and INA treatments had no effect on the 
localization of the protein. These results indicated that NPR1 was localized in the cytoplasm 
in the mesophyll cells, and that upon induction the NPR1 protein was transported into the 

30 nucleus resulting in PR1 gene expression and resistance. In the guard cells, the NPR1 protein 
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was localized in the nuclei even without an SAR induction, an intriguing observation because 
constitutive activation of defense mechanisms in these cells may be necessary to fend off 
microbial pathogens from gaining entry into the plant through stomata. Since mGFP alone 
showed no induced nuclear translocation, the nuclear transport of the NPR1-GFP fusion must 
5 be directed by a signal in NPR1. Consistent with this, the following two potential nuclear 
localization sequences (NLS's) were found in NPR1 : 

252 RRKELGLEVPKVKK 265 (SEQ ID NO:22); and 

541 KKQRYMEIQETLKK 554 (SEQ ID NO:23). 

Significantly, nuclear translocation in tissues infected by the virulent pathogen Psm 
1 0 ES4326 was also observed (Fig. 1 1 A). This pattern of induction was also observed to 
coincide with the pattern of PR gene expression observed in plants after infection 
(Fig. 11B). 

Characterization of nnr Mutations 

To further characterize the NPR1 gene, the mutations in nprl-1, nprl-2, nprl-3, and 

15 nprl-4 were identified by DNA sequencing. The mutant nprl-4 is a new nprl allele that was 
identified in the Col-0 (BGL2-GUS) background based on its enhanced susceptibility to Psm 
ES4326. Each mutant allele was found to contain a single base-pair change. The nprl-1, 
nprl -2, nprl -3, and nprl-4 alleles respectively altered the highly conserved histidine (residue 
334) in the third ankyrin-repeat consensus to a tyrosine, changed a cysteine (residue 150) to a 

20 tyrosine, introduced a nonsense codon (residue 400) that should result in a truncated protein 
lacking 194 amino acids of the C-terminal end of the protein, and destroyed the acceptor site 
of the third intron junction. All of these point mutations are GC to AT transitions, consistent 
with the mode of action of the mutagen, ethyl-methanesulfonate (EMS), used for the 
generation of these mutations. 

25 Genetic Ana lysis of the Plant Defense Response Using Arabidovsis thaliana 

Although biochemical studies have played an important role in elucidating the general 
features of the plant defense response, the complexity of the defense response limits the 
utility of biochemical analysis in determining the importance of particular defense responses 
or enzymes in conferring resistance to pathogens. Isolation of plant defense-response 

30 mutants not only helps elucidate the roles of known pathogen-induced responses in 
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combating particular pathogens, but also facilitates the identification of plant defense 
mechanisms not already correlated with a known biochemical or molecular genetic response. 
With the development of well-characterized hostpathogen systems involving the model plant 
Arabidopsis thaliana as the host as described herein, comprehensive genetic analysis of 
5 acquired resistance responses is made possible. 

All of the major features of the plant defense response that have been observed in crop 
plants have also been observed in Arabidopsis-pathogsn interactions. For example, several 
resistance gene-a vr gene interactions have been identified for both bacterial and fungal 
pathogens of Arabidopsis (Bisgrove et al., Plant Cell 6:927-933, 1994; Holub et al., Mol. 

10 Plant-Microbe Interact. 7:223-239, 1994; Kunkel et al., Plant Cell 5:865-875, 1993; Yu et 
al., Mol. Plant-Microbe Interact. 6:434-443, 1993). Moreover, all of the important features 
of SAR have been observed in Arabidopsis (Uknes et al., Plant Cell 4:645-656, 1992; Uknes 
et al., MoL Plant-Microbe Interact. 6:692-698, 1993). Importantly, the power of Arabidopsis 
genetic analysis has recently been used to help identify a variety of components of the 

15 Arabidopsis defense response to pathogen attack (Bent et al, Science 265: 1 856-1 860, 1 994; 
Bowling et al., Plant Cell 6: 1845-1 857, 1994; Cao et al., Plant Cell 6:1583-1592, 1994; 
Century et al., Proc. Natl Acad. Sci. USA 92:6597-6601, 1995; Delaney et al., Proc. Natl 
Acad, ScL USA 92:6602-6606, 1995; Dietrich et al., Cell 77:565-577, 1994; Glazebrook and 
Ausubel, Proc. Natl Acad. Sci. USA 91:8955-8959, 1994; Glazebrook et al., Genetics 

20 143:973-982, 1996; Grant et al., Science 269:843-846, 1995; Greenberg and Ausubcl, Plant 
J. 4:327-341, 1993; Greenberg et al., Plant J. 4:327-341, 1994; Mindrinos et al., Cell 
78:1089-1099, 1994). Thus, the results described herein provide the basis for identifying 
genes that are involved in acquired disease resistance throughout the plant kingdom and are 
not limited to Arabidopsis. 

25 Isolation of Solanaceou s AR Genes 

Using the Arabidopsis NPRI cDNA sequence shown in Fig. 5 (SEQ ID NO:2), the 
isolation of AR homologs that are found in solanaceous plants (e.g., potato, eggplant, tomato, 
tobacco, petunia, and pepper) is readily accomplished using standard techniques. 

For example, a Nicotiana glutinosa cDN A library was screened for the presence of an 

30 NPRI homolog. The library was constructed in the lambda ZAP II vector from poly 
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( A+)RNA isolated from Nicotiana glutinosa plants infected with tobacco mosaic virus 
(TMV) (Whitham et al., CelllS: 1101-1115, 1994). Bacteriophage were plated on NZY 
media using XL-1 Blue host cells. Approximately 10 6 plaques were screened by transferring 
the phage DNA onto positively charged nylon membrane (GeneScreen; DuPont-New 
5 England Nuclear) and probing with a random primed 32 P labeled probe that was prepared 
using the full-length Arabidopsis NPR1 cDNA as the template. Hybridization was performed 
at 37 °C in 40% formamide, 5X SSC, 5X Denhardt, 1% SDS, and 10% dextran sulfate. The 
filters were washed in 2X SSC for fifteen minutes at room temperature and 2X SSC, 1% SDS 
for thirty minutes at 37 °C. 

10 Two hybridizing clones were identified and purified. The pBluescript plasmids were 

excised using XL-1 Blue host cells and R408 helper phage. Restriction enzyme analysis 
indicated that the two positive clones contained inserts of approximately 3600 bp and 2100 
bp. Restriction digests and sequence analysis indicated that the 3600 bp insert represented 
two independent cDNAs of 2100 bp and 1500 bp and that the two independently isolated 

15 2100 bp cDNAs were identical. Both strands of the 2 1 00 bp cDNA were sequenced using 
35 S-dATP and the Sequenase sequencing kit (U.S. Biochemicals, Cleveland, OH). The 
nucleotide and amino acid sequences encoding the Nicotiana glutinosa NPR1 homolog are 
shown in Fig. 7A (SEQ ID NO: 13) and Fig. 7B (SEQ ID NO: 14), respectively. 
Isolation of Other Acquired Resistance Genes 

20 Any plant cell can serve as the nucleic acid source for the molecular cloning of an AR 

gene. Isolation of an AR gene involves the isolation of those DNA sequences which encode a 
protein exhibiting AR-associated structures, properties, or activities, for example, an ankyrin- 
repeat motif and the ability to induce gene expression of PR proteins that limit pathogen 
infection. Based on the AR genes and polypeptides described herein, the isolation of 

25 additional plant AR coding sequences is made possible using standard strategies and 
techniques that are well known in the art 

In one particular example, the AR sequences described herein may be used, together 
with conventional screening methods of nucleic acid hybridization screening. Such 
hybridization techniques and screening procedures are well known to those skilled in the art 

30 and are described, for example, in Benton and Davis, Science 196: 1 80, 1977; Grunstein and 
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Hogness, Proc. Natl. Acad. Sci., USA 72:3961, 1975; Ausubel et al. (supra)\ Berger and 
Kimmel (supra); and Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, New York. In one particular example, all or part of the 
NPR1 cDNA (described herein) may be used as a probe to screen a recombinant plant DNA 
5 library for genes having sequence identity to the AR gene. Hybridizing sequences are 
detected by plaque or colony hybridization according to the methods described below. 

Alternatively, using all or a portion of the amino acid sequence of the AR 
polypeptide, one may readily design AR-specific oligonucleotide probes, including AR 
degenerate oligonucleotide probes (i.e., a mixture of all possible coding sequences for a given 

10 amino acid sequence). These oligonucleotides may be based upon the sequence of either 

DNA strand and any appropriate portion of the AR sequence (Figs. 4 and 5, 7A, and 7B SEQ 
ID NOS:l t 2 t 3, 13, and 14, respectively). General methods for designing and preparing such 
probes are provided, for example, in Ausubel et al., 1996, Current Protocols in Molecular 
Biology, Wiley lnterscience, New York, and Berger and Kimmel, Guide to Molecular 

15 Cloning Techniques, 1987, Academic Press, New York. These oligonucleotides are useful 
for AR gene isolation, either through their use as probes capable of hybridizing to AR 
complementary sequences or as primers for various amplification techniques, for example, 
polymerase chain reaction (PCR) cloning strategies. If desired, a combination of different 
oligonucleotide probes may be used for the screening of a recombinant DNA library. The 

20 oligonucleotides may be detectably-labeled using methods known in the art and used to probe 
filter replicas from a recombinant DNA library. Recombinant DNA libraries are prepared 
according to methods well known in the art, for example, as described in Ausubel et al. 
(supra), or they may be obtained from commercial sources. 

In one particular example of this approach, related AR sequences having greater than 

25 80% identity are detected or isolated using high stringency conditions. High stringency 

conditions may include hybridization at about 42 °C and about 50% formamide, 0.1 mg/mL 
sheared salmon sperm DNA, 1% SDS, 2X SSC, 10% Dextran sulfate, a first wash at about 
65°C t about 2X SSC, and 1% SDS, followed by a second wash at about 65 °C and about 0. IX 
SSC. Alternatively, high stringency conditions may include hybridization at about 42 °C and 

30 about 50% formamide, 0.1 mg/mL sheared salmon sperm DNA, 0.5% SDS, 5X SSPE, IX 
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Denhardt's, followed by two washes at room temperature and 2X SSC, 0.1% SDS, and two 
washes at between 55-60°C and 0.2X SSC 0.1% SDS. 

In another approach, low stringency hybridization conditions for detecting AR genes 
having about 40% or greater sequence identity to the AR genes described herein include, for 
5 example, hybridization at about 42 °C and 0.1 mg/mL sheared salmon sperm DNA, 1% SDS, 
2X SSC, and 10% Dextran sulfate (in the absence of formamide), and a wash at about 37 °C 
and 6X SSC, about 1% SDS. Alternatively, the low stringency hybridization may be carried 
out at about 42 °C and 40% formamide, 0. 1 mg/mL sheared salmon sperm DNA, 0.5% SDS, 
5X SSPE, IX Dcnhardt's, followed by two washes at room temperature and 2X SSC, 0.1% 
10 SDS and two washes at room temperature and 0.5X SSC, 0.1% SDS. These stringency 

conditions are exemplary; other appropriate conditions may be determined by those skilled in 
the art. 

If desired, RNA gel blot analysis of total or poly(A+) RNAs isolated from any plant 
(e.g., those crop plants described herein) may be used to determine the presence or absence of 

15 an AR transcript using conventional methods. As an example, a Northern blot of potato RNA 
was prepared according to standard methods and probed with a 1.96-kb NPR1 Hindlll 
fragment in a hybridization solution containing 50% formamide, 5X SSC, 2.5X Denhardt's 
solution, and 300 jig/mL salmon sperm DNA at 37°C Following overnight hybridization, 
the blot was washed two times for ten minutes each in a solution containing IX SSC, 0.2% 

20 SDS at 37 °C. An autoradiogram of the blot demonstrated the presence an MPtfy-hybridizing 
RNA in the potato RNA sample, indicating that this solanaceous crop plant encoded an 
acquired resistance gene. These results further indicate that AR genes are not restricted to the 
crucifer Arabidopsis. Isolation of this hybridizing transcript is performed using standard 
cDNA cloning techniques. 

25 As discussed above, AR oligonucleotides may also be used as primers in 

amplification cloning strategies, for example, using PCR. PCR methods are well known in 
the art and are described, for example, in PCR Technology, Erlich, ed., Stockton Press, 
London, 1989; PCR Protocols: A Guide to Methods and Applications, Innis et al., eds., 
Academic Press, Inc., New York, 1990; and Ausubel et al. (supra). Primers are optionally 

30 designed to allow cloning of the amplified product into a suitable vector, for example, by 
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including appropriate restriction sites at the 5* and 3' ends of the amplified fragment (as 
described herein). If desired, AR sequences may be isolated using the PCR "RACE" 
technique, or Rapid Amplification of cDNA Ends (see, e.g., Innis et al. (supra)). By this 
method, oligonucleotide primers based on an AR sequence are oriented in the 3' and 5' 
5 directions and are used to generate overlapping PCR fragments. These overlapping 3 1 - and 
5'-end RACE products are combined to produce an intact full-length cDNA. This method is 
described in Innis et al. (supra); and Frohman et al., Proc, Natl. Acad. ScL USA 85:8998, 
1988. Exemplary oligonucleotide primers useful for amplifying AR gene sequences include, 
without limitation: 

1 0 A. AA( A/G)G A( A/G)GA(T/C)CA(T/C)ACNA A (SEQ ID NO:24); 

B. TA(T/C)TG(T/C)AA(T/C)GTNAA(A/G)AC (SEQ ID NO:25); 

C. GCCATNGTNGC(T/C)TG(T/C)TT (SEQ ID NO:26); 

D. AA( A/G)GTN AA( A/G) AA( A/G)C A(C/T)GT (SEQ ID NO:27); 

E. (A/G)AA(Cn , )TC(A/G)CANGTNCC(C/T)TTCAT (SEQ ID NO:28). 
1 5 For each of the above sequences, N is A, T, G or C. 

Alternatively, any plant cDNA or cDNA expression library may be screened by 
functional complementation of an npr mutant (for example, the nprl mutant described 
herein) according to standard methods described herein. 

Confirmation of a sequenced relatedness to the AR polypeptide family may be 
20 accomplished by a variety of conventional methods including, but not limited to, functional 
complementation assays and sequence comparison of the gene and its expressed product. In 
addition, the activity of the gene product may be evaluated according to any of the techniques 
described herein, for example, the functional or immunological properties of its encoded 
product. 

25 Once an AR sequence is identified, it is cloned according to standard methods and 

used for the construction of plant expression vectors as described below. 
AR Polypeptide Expression 

AR polypeptides may be expressed and produced by transformation of a suitable host 
cell with all or part of an AR cDN A (for example, the cDN A described above) in a suitable 

30 expression vehicle or with a plasmid construct engineered for increasing the expression of an 
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AR polypeptide (supra) in vivo. 

Those skilled in the field of molecular biology will understand that any of a wide 
variety of expression systems may be used to provide the recombinant protein. The precise 
host cell used is not critical to the invention. The AR protein may be produced in a 
5 prokaryotic host, for example, E. coli, or in a eukaryotic host, for example, Saccharomyces 
cerevisiae^ mammalian cells (for example, COS 1 or NIH 3T3 cells), or any of a number of 
plant cells or whole plant including, without limitation, algae, tree species, ornamental 
species, temperate fruit species, tropical fruit species, vegetable species, legume species, 
crucifer species, monocots, dicots, or in any plant of commercial or agricultural significance. 

1 0 Particular examples of suitable plant hosts include, but are not limited to, conifers, petunia, 
tomato, potato, pepper, tobacco, Arabidopsis^ lettuce, sunflower, oilseed rape, flax, cotton, 
sugarbeet, celery, soybean, alfalfa, Medicago* lotus, Vigna, cucumber, carrot, eggplant, 
cauliflower, horseradish, morning glory, poplar, walnut, apple, grape, asparagus, cassava, 
rice, maize, millet, onion, barley, orchard grass, oat, rye, and wheat. 

1 5 Such cells are available from a wide range of sources including the American Type 

Culture Collection (Rockland, MD); or from any of a number seed companies, for example, 
W. Atlee Burpee Seed Co. (Warminster, PA), Park Seed Co. (Greenwood, SC), Johnny Seed 
Co. (Albion, ME), or Northrup King Seeds (Harstville, SC). Descriptions and sources of 
useful host cells are also found in Vasil LK., Cell Culture and Somatic Cell Genetics of 

20 _ Plants, Vol I, II, III Laboratory Procedures and Their Applications Academic Press, New 
York, 1984; Dixon, R.A., Plant Cell Culture-A Practical Approach, IRL Press, Oxford 
University, 1985; Green et al., Plant Tissue and Cell Culture, Academic Press, New York, 
1987; and Gasser and Fraley, Science 244:1293, 1989. 

For prokaryotic expression, DNA encoding an AR polypeptide is carried on a vector 

25 operably linked to control signals capable of effecting expression in the prokaryotic host. If 
desired, the coding sequence may contain, at its 5' end, a sequence encoding any of the 
known signal sequences capable of effecting secretion of the expressed protein into the 
periplasmic space of the host cell, thereby facilitating recovery of the protein and subsequent 
purification. Prokaryotes most frequently used are various strains of E. coli\ however, other 

30 microbial strains may also be used. Plasmid vectors are used which contain replication 
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origins, selectable markers, and control sequences derived from a species compatible with the 
microbial host. Examples of such vectors are found in Pouwels et al. (supra) or Ausubel et 
al. (supra). Commonly used prokaryotic control sequences (also referred to as "regulatory 
elements") are defined herein to include promoters for transcription initiation, optionally with 
5 an operator, along with ribosome binding site sequences. Promoters commonly used to direct 
protein expression include the beta-lactamase (penicillinase), the lactose (lac) (Chang et al., 
Nature 198:1056, 1977), the tryptophan (Trp) (Goeddel et al., Nucl. Acids Res. 8:4057, 
1980), and the tac promoter systems, as well as the lambda-derived P, promoter and N-gene 
ribosome binding site (Simatake et al., Nature 292:128, 1981). 

10 One particular bacterial expression system for AR polypeptide production is the E. 

coli pET expression system (Novagen, Inc., Madison, WI). According to this expression 
system, DNA encoding an AR polypeptide is inserted into a pET vector in an orientation 
designed to allow expression. Since the AR gene is under the control of the T7 regulatory 
signals, expression of AR is induced by inducing the expression of T7 RNA polymerase in 

1 5 the host cell. This is typically achieved using host strains which express T7 RNA polymerase 
in response to IPTG induction. Once produced, recombinant AR polypeptide is then isolated 
according to standard methods known in the art, for example, those described herein. 

Another bacterial expression system for AR polypeptide production is the pGEX 
expression system (Pharmacia). This system employs a GST gene fusion system which is 

-20 designed for high-level expression of genes or gene fragments as fusion proteins with rapid 
purification and recovery of functional gene products. The protein of interest is fused to the 
carboxyl terminus of the glutathione S-transferase protein from Schistosoma japonicum and 
is readily purified from bacterial lysates by affinity chromatography using Glutathione 
Sepharose 4B. Fusion proteins can be recovered under mild conditions by elution with 

25 glutathione. Cleavage of the glutathione S-transferase domain from the fusion protein is 
facilitated by the presence of recognition sites for site-specific proteases upstream of this 
domain. For example, proteins expressed in pGEX-2T plasmids may be cleaved with 
thrombin; those expressed in pGEX-3X may be cleaved with factor Xa. 

For eukaryotic expression, the method of transformation or transfection and the 

30 choice of vehicle for expression of the AR polypeptide will depend on the host system 
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selected. Transformation and transfection methods are described, e.g., in Ausubel et al. 
(supra); Weissbach and Weissbach, Methods for Plant Molecular Biology, Academic Press, 
1989; Gelvin ct al., Plant Molecular Biology Manual, Kluwer Academic Publishers, 1990; 
Kindle, K., Proa Natl. Acad. ScL, U.S.A. 87:1228, 1990; Potrykus, L, Annu. Rev. Plant 
5 Physiol. Plant Mol. Biology 42:205, 1991 ; and BioRad (Hercules, CA) Technical Bulletin 
#1687 (Biolistic Particle Delivery Systems). Expression vehicles may be chosen from those 
provided, e.g., in Cloning Vectors: A Laboratory Manual (P.H. Pouwels et al., 1985, Supp. 
1987); Gasser and Fraley (supra); Clontech Molecular Biology Catalog (Catalog 1992/93 
Tools for the Molecular Biologist, Palo Alto, CA); and the references cited above. Other 

10 expression constructs are described by Fraley et al. (U.S. Pat. No. 5,352,605). 
Construction of Plant Transgenes 

Most preferably, an AR polypeptide is produced by a stably-transfected plant cell line, 
a transiently-transfected plant cell line, or by a transgenic plant. A number of vectors suitable 
for stable or extrachromosomal transfection of plant cells or for the establishment of 

1 5 transgenic plants are available to the public; such vectors are described in Pouwels et al. 
(supra), Weissbach and Weissbach (supra), and Gelvin et al. (supra). Methods for 
constructing such cell lines are described in, e.g., Weissbach and Weissbach (supra), and 
Gelvin et al. (supra). 

Typically, plant expression vectors include (1) a cloned plant gene under the 

20 transcriptional control of 5 1 and 3' regulatory sequences and (2) a dominant selectable marker. 
Such plant expression vectors may also contain, if desired, a promoter regulatory region (for 
example, one conferring inducible or constitutive, pathogen- or wound-induced, 
environmentally- or developmental ly-regulated, or cell- or tissue-specific expression), a 
transcription initiation start site, a ribosome binding site, an RNA processing signal, a 

25 transcription termination site, and/or a polyadenylation signal. 

Once the desired AR nucleic acid sequence is obtained as described above, it may be 
manipulated in a variety of ways known in the art For example, where the sequence involves 
non-coding flanking regions, the flanking regions may be subjected to mutagenesis. 

The AR DNA sequence of the invention may, if desired, be combined with other 

30 DNA sequences in a variety of ways. The AR DNA sequence of the invention may be 
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employed with all or part of the gene sequences normally associated with the AR protein. In 
its component parts, a DNA sequence encoding an AR protein is combined in a DNA 
construct having a transcription initiation control region capable of promoting transcription 
and translation in a host cell. 
5 In general, the constructs will involve regulatory regions functional in plants which 

provide for modified production of AR protein as discussed herein. The open reading frame 
coding for the AR protein or functional fragment thereof will be joined at its 5 ! end to a 
transcription initiation regulatory region such as the sequence naturally found in the 5' 
upstream region of the AR structural gene. Numerous other transcription initiation regions 

10 are available which provide for constitutive or inducible regulation. 

For applications where developmental, cell, tissue, hormonal, or environmental 
expression is desired, appropriate 5' upstream non-coding regions are obtained from other 
genes, for example, from genes regulated during meristem development, seed development, 
embryo development, or leaf development. 

1 5 Regulatory transcript termination regions may also be provided in DNA constructs of 

this invention as well. Transcript termination regions may be provided by the DNA sequence 
encoding the AR protein or any convenient transcription termination region derived from a 
different gene source. The transcript termination region will contain preferably at least 1-3 
kb of sequence 3' to the structural gene from which the termination region is derived. Plant 

20 expression constructs having AR as the DNA sequence of interest for expression (in either 
the sense or antisense orientation) may be employed with a wide variety of plant life, 
particularly plant life involved in the production of storage reserves (for example, those 
involving carbon and nitrogen metabolism). Such genetically-engineered plants are useful 
for a variety of industrial and agricultural applications as discussed infra. Importantly, this 

25 invention is applicable to dicotyledons and monocotyledons, and will be readily applicable to 
any new or improved transformation or regeneration method. 

The expression constructs include at least one promoter operably linked to at least one 
AR gene. An example of a useful plant promoter according to the invention is a caulimovirus 
promoter, for example, a cauliflower mosaic virus (CaMV) promoter. These promoters 

30 confer high levels of expression in most plant tissues, and the activity of these promoters is 
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not dependent on virally encoded proteins. CaMV is a source for both the 35S and 19S 
promoters. Examples of plant expression constructs using these promoters are found in 
Fraley et al., U.S. Pat. No. 5,352,605. In most tissues of transgenic plants, the CaMV 35S 
promoter is a strong promoter (see, e.g., Odell et al., Nature 313:810, 1 985). The CaMV 
5 promoter is also highly active in monocots (see, e.g., Dekeyscr ct al., Plant Cell 2:591 , 1990; 
Terada and Shimamoto, MoL Gen. Genet. 220:389, 1990). Moreover, activity of this 
promoter can be further increased (i.e., between 2-10 fold) by duplication of the CaMV 35S 
promoter (see e.g., Kay et al., Science 236:1299, 1987; Ow et al., Proc. Natl. Acad. ScL, 
U.S.A. 84:4870, 1987; and Fang et al., Plant Cell 1:141, 1989, and McPherson and Kay, U.S. 

10 Pat. No. 5,378,142). 

Other useful plant promoters include, without limitation, the nopaline synthase (NOS) 
promoter (An et al., Plant Physiol. 88:547, 1988 and Rodgers and Fraley, U.S. Pat. No. 
5,034,322), the octopine synthase promoter (Fromm et al., Plant Cell 1:977, 1989), figwort 
mosiac virus (FMV) promoter (Rodgers, U.S. Pat. No. 5,378,619), and the rice actin 

1 5 promoter (Wu and McElroy, W09 1 /09948). 

Exemplary monocot promoters include, without limitation, commelina yellow mottle 
virus promoter, sugar cane badna virus promoter, rice tungro bacilli form virus promoter, 
maize streak vims element, and wheat dwarf virus promoter. 

For certain applications, it may be desirable to produce the AR gene product in an 

20 appropriate tissue, at an appropriate level, or at an appropriate developmental time. For this 
purpose, there are an assortment of gene promoters, each with its own distinct characteristics 
embodied in its regulatory sequences, shown to be regulated in response to inducible signals 
such as the environment, hormones, and/or developmental cues. These include, without 
limitation, gene promoters that are responsible for heat-regulated gene expression (see, e.g., 

25 Callis et al., Plant Physiol. 88:965, 1988; Takahashi and Komeda, MoL Gen. Genet. 219:365, 
1989; and Takahashi et al. Plant J. 2:751, 1992), light-regulated gene expression (e.g., the 
pea rbcSSA described by Kuhlemeier et al., Plant Cell 1 :47 1 , 1989; the maize rbcS promoter 
described by SchSf&er and Sheen, Plant Cell 3:997, 1991; the chlorophyll a/b-binding 
protein gene found in pea described by Simpson et al, EMBOJ. 4:2723, 1985; the Arabssu 

30 promoter, or the rice rbs promoter), hormone-regulated gene expression (for example, the 
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abscisic acid (ABA) responsive sequences from the Em gene of wheat described by Marcotte 
et al., Plant Cell 1 :969, 1989; the ABA-inducible HVA1 and HVA22, and rd29A promoters 
described for barley and Arabidopsis by Straub et al.. Plant Cell 6:617, 1994 and Shen et al., 
Plant Cell 7:295, 1995; and wound-induced gene expression (for example, of wunl described 
5 by Siebertz et al., Plant Cell 1 :961, 1989), organ-specific gene expression (for example, of 
the tuber-specific storage protein gene described by Roshal et al., EMBOJ. 6:1 155, 1987; the 
23-kDa zein gene from maize described by Schemthaner et al., EMBO J. 7: 1 249, 1 988; or the 
French bean B-phaseolin gene described by Bustos et al. t Plant Cell 1 :839, 1989), or 
pathogen-inducible promoters (for example, PR-1, prp-1, or p-1,3 glucanase promoters, the 

1 0 fungal-inducible wirla promoter of wheat, and the nematode-inducible promoters, TobRB7- 
5 A and Hrng-1, of tobacco and parsley, respectively). 

Plant expression vectors may also optionally include RNA processing signals, e.g, 
introns, which have been shown to be important for efficient RNA synthesis and 
accumulation (Callis et al., Genes and Dev. 1:1183, 1987). The location of the RNA splice 

1 5 sequences can dramatically influence the level of transgene expression in plants. In view of 
this fact, an intron may be positioned upstream or downstream of an AR polypeptide- 
encoding sequence in the transgene to modulate levels of gene expression. 

In addition to the aforementioned 5' regulatory control sequences, the expression 
vectors may also include regulatory control regions which are generally present in the 3' 

20 regions of plant genes (Thornburg et al., Proc. NatL Acad. ScL U.S.A. 84:744, 1987; An et 
al., Plant Cell 1:115, 1989). For example, the 3' terminator region may be included in the 
expression vector to increase stability of the mRNA. One such terminator region may be 
derived from the PI-II terminator region of potato. In addition, other commonly used 
terminators are derived from the octopine or nopaline synthase signals. 

25 The plant expression vector also typically contains a dominant selectable marker gene 

used to identify those cells that have become transformed. Useful selectable genes for plant 
systems include genes encoding antibiotic resistance genes, for example, those encoding 
resistance to hygromycin, kanamycin, bleomycin, G418, streptomycin, or spectinomycin. 
Genes required for photosynthesis may also be used as selectable markers in photosynthetic- 

30 deficient strains. Finally, genes encoding herbicide resistance may be used as selectable 
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al., Plant Cell Physiol. 25:1353, 1984), (6) electroporation protocols (see, e.g., Gelvin et al., 
supra] Dekeyser et al., supra; Fromm et al, Nature 319:791, 1986; Sheen Plant Cell 2:1027, 
1 990; or Jang and Sheen Plant Cell 6:1665, 1994), and (7) the vortcxing method (see, e.g., 
Kindle supra). The method of transformation is not critical to the invention. Any method 
5 which provides for efficient transformation may be employed. As newer methods are 

available to transform crops or other host cells, they may be directly applied. Suitable plants 
for use in the practice of the invention include, but are not limited to, sugar cane, wheal, rice, 
maize, sugar beet, potato, barley, manioc, sweet potato, soybean, sorghum, cassava, banana, 
grape, oats, tomato, millet, coconut, orange, rye, cabbage, apple, watermelon, canola, cotton, 

10 carrot, garlic, onion, pepper, strawberry, yam, peanut, onion, bean, pea, mango, citrus plants, 
walnuts, and sunflower. 

The following is an example outlining one particular technique, an Agrobacterium- 
mediated plant transformation. By this technique, the general process for manipulating genes 
to be transferred into the genome of plant cells is carried out in two phases. First, cloning and 

1 5 DN A modification steps are carried out in £. coli, and the plasmid containing the gene 
construct of interest is transferred by conjugation or electroporation into Agrobacterium. 
Second, the resulting Agrobacterium strain is used to transform plant cells. Thus, for the 
generalized plant expression vector, the plasmid contains an origin of replication that allows 
it to replicate in Agrobacterium and a high copy number origin of replication functional in £. 

20 colL This permits facile production and testing of transgenes in E. coli prior to transfer to 
Agrobacterium for subsequent introduction into plants. Resistance genes can be carried on 
the vector, one for selection in bacteria, for example, streptomycin, and another that will 
function in plants, for example, a gene encoding kanamycin resistance or herbicide resistance. 
Also present on the vector arc restriction endonuc lease sites for the addition of one or more 

25 transgenes and directional T-DNA bonier sequences which, when recognized by the transfer 
functions of Agrobacterium, delimit the DNA region that will be transferred to the plant. 

In another example, plant cells may be transformed by shooting into the cell tungsten 
microprojectiles on which cloned DNA is precipitated. In the Biolistic Apparatus (Bio-Rad) 
used for the shooting, a gunpowder charge (22 caliber Power Piston Tool Charge) or an air- 

30 driven blast drives a plastic macroprojectile through a gun barrel. An aliquot of a suspension 
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markers; useful herbicide resistance genes include the bar gene encoding the enzyme 
phosphinothricin acetyltransferase and conferring resistance to the broad spectrum herbicide 
Basta® (Hoechst AG, Frankfurt, Germany). 

Efficient use of selectable markers is facilitated by a determination of the 
5 susceptibility of a plant cell to a particular selectable agent and a determination of the 
concentration of this agent which effectively kills most, if not all, of the transformed cells. 
Some useful concentrations of antibiotics for tobacco transformation include, e.g., 75-100 
Hg/mL (kanamycin), 20-50 ng/mL (hygromycin), or 5-10 ng/mL (bleomycin). A useful 
strategy for selection of transformants for herbicide resistance is described, e.g., by Vasil et 
10 al., supra. 

In addition, if desired, the plant expression construct may contain a modified or fully- 
synthetic structural AR coding sequence which has been changed to enhance the performance 
of the gene in plants. Methods for constructing such a modified or synthetic gene are 
described in Fischoff and Perlak, U.S. Pat. No. 5,500,365. 

1 5 It should be readily apparent to one skilled in the art of molecular biology, especially 

in the field of plant molecular biology, that the level of gene expression is dependent, not 
only on the combination of promoters, RNA processing signals, and terminator elements, but 
also on how these elements are used to increase the levels of selectable marker gene 
expression. 

20 Plant Transformation 

Upon construction of the plant expression vector, several standard methods are 
available for introduction of the vector into a plant host, thereby generating a transgenic 
plant. These methods include (1) Agrobacterium-mediated transformation (A tumefaciens or 
A. rhizogenes) (see, e.g., Lichtenstein and Fuller In: Genetic Engineering, vol 6, PWJ Rigby, 

25 ed, London, Academic Press, 1987; and Lichtenstein, CP., and Draper, J,. In: DNA Cloning, 
Vol II, D.M Glover, ed, Oxford, IRI Press, 1985)), (2) the particle delivery system (see, e.g., 
Gordon-Kamm et al., Plant Cell 2:603 (1990); or BioRad Technical Bulletin 1687, supra), 
(3) microinjection protocols (see, e.g., Green et al., supra), (4) polyethylene glycol (PEG) 
procedures (see, e.g., Draper et al., Plant Cell Physiol. 23:451, 1982; or e.g., Zhang and Wu, 

30 Theor. Appl. GeneL 76:835, 1988), (5) liposome-mediated DNA uptake (see, e.g., Freeman et 
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of tungsten particles on which DNA has been precipitated is placed on the front of the plastic 
macroprojectile. The latter is fired at an acrylic stopping plate that has a hole through it that 
is too small for the macroprojectile to pass through. As a result, the plastic macroprojectile 
smashes against the stopping plate, and the tungsten microprojectiles continue toward their 
5 target through the hole in the plate. For the instant invention the target can be any plant cell, 
tissue, seed, or embryo. The DNA introduced into the cell on the microprojectiles becomes 
integrated into either the nucleus or the chloroplast. 

In general, transfer and expression of transgenes in plant cells are now routine 
practices to those skilled in the art, and have become major tools to carry out gene expression 
1 0 studies in plants and to produce improved plant varieties of agricultural or commercial 
interest. 

Transgenic Plant Regeneration 

Plant cells transformed with a plant expression vector can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant tissue 
1 5 culture techniques. It is well known in the art that various cells, tissues, and organs from 
almost any plant can be successfully cultured to regenerate an entire plant; such techniques 
are described, e.g., in Vasil supra; Green et al., supra, Weissbach and Weissbach, supra; and 
Gelvin et al., supra. 

In one particular example, a cloned AR polypeptide construct under the control of the 
20 35S CaMV promoter and the nopaline synthase terminator and carrying a selectable marker 
(for example, kanamycin resistance) is transformed into Agrobacterium. Transformation of 
leaf discs (for example, of tobacco or potato leaf discs), with vector-containing 
Agrobacterium is carried out as described by Horsch et al. (Science 227:1229, 1985). 
Putative transformants are selected after a few weeks (for example, 3 to 5 weeks) on plant 
25 tissue culture media containing kanamycin (e.g. 100 ng/mL). Kanamycin-resistant shoots are 
then placed on plant tissue culture media without hormones for root initiation. Kanamycin- 
resistant plants are then selected for greenhouse growth. If desired, seeds from self-fertilized 
transgenic plants can then be sowed in a soil-less medium and grown in a greenhouse. 
Kanamycin-resistant progeny are selected by sowing surfaced sterilized seeds on hormone- 
30 free kanamycin-containing media. Analysis for the integration of the transgene is 
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accomplished by standard techniques (see, for example, Ausubel et al. supra; Gelvin et al. 
supra). 

Transgenic plants expressing the selectable marker are then screened for transmission 
of the transgene DNA by standard immunoblot and DNA detection techniques. Each positive 
5 transgenic plant and its transgenic progeny are unique in comparison to other transgenic 
plants established with the same transgene. Integration of the transgene DNA into the plant 
genomic DNA is in most cases random, and the site of integration can profoundly affect the 
levels and the tissue and developmental patterns of transgene expression. Consequently, a 
number of transgenic lines are usually screened for each transgene to identify and select 

10 plants with the most appropriate expression profiles. 

Transgenic lines are evaluated for levels of transgene expression. Expression at the 
RNA level is determined initially to identify and quantitate expression-positive plants. 
Standard techniques for RNA analysis are employed and include PCR amplification assays 
using oligonucleotide primers designed to amplify only transgene RNA templates and 

15 solution hybridization assays using transgene-specific probes (see, e.g., Ausubel et al., 
supra). The RNA-positive plants are then analyzed for protein expression by Western 
immunoblot analysis using AR specific antibodies (see, e.g., Ausubel et al., supra). In 
addition, in situ hybridization and immunocytochemistry according to standard protocols can 
be done using transgene-specific nucleotide probes and antibodies, respectively, to localize 

20 sites of expression within transgenic tissue. 

Ectopic expression of AR genes is useful for the production of transgenic plants 
having an increased level of resistance to disease-causing pathogens. 

In addition, if desired, once the recombinant AR protein is expressed in any cell or in 
a transgenic plant (for example, as described above), it may be isolated, e.g., using affinity 

25 chromatography. In one example, an anti-AR polypeptide antibody (e.g., produced as 

described in Ausubel et al., supra, or by any standard technique) may be attached to a column 
and used to isolate the polypeptide. Lysis and fractionation of AR-producing cells prior to 
affinity chromatography may be performed by standard methods (see, e.g., Ausubel et al., 
supra). Once isolated, the recombinant protein can, if desired, be further purified, for 

30 example, by high performance liquid chromatography (see, e.g., Fisher, Laboratory 
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Techniques In Biochemistry And Molecular Biology, eds., Work and Burdon, Elsevier, 1980). 

Polypeptides of the invention, particularly short AR protein fragments, can also be 
produced by chemical synthesis (e.g., by the methods described in Solid Phase Peptide 
Synthesis, 2nd ed., 1984 The Pierce Chemical Co., Rockford, IL). These general techniques 
5 of polypeptide expression and purification can also be used to produce and isolate useful AR 
fragments or analogs. 

Ectopic Expression of AR Genes for Engineering Plant Defense Responses to Pathogens 

As discussed above, plasmid constructs designed for the expression of AR gene 
products are useful, for example, for activating plant defense pathways that confer anti- 

10 pathogenic properties to a transgenic plant. AR genes that are isolated from a host plant (e.g.,. 
Arabidopsis or Nicotiana) may be engineered for expression in the same plant, a closely 
related species, or a distantly related plant species. For example, the cruciferous Arabidopsis 
NPR1 gene may be engineered for constitutive low level expression and then transformed 
into an Arabidopsis host plant. Alternatively, the Arabidopsis NPRJ gene may be engineered 

1 5 for expression in other cruciferous plants, such as the Brassicas (for example, broccoli, 

cabbage, and cauliflower). Similarly, the NPR1 homolog of Nicotiana glutinosa is useful for 
expression in related solanaceous plants, such as tomato, potato, and pepper. To achieve 
pathogen resistance, it is important to express an AR protein at an effective level. Evaluation 
of the level of pathogen protection conferred to a plant by ectopic expression of an AR gene 

20 is determined according to conventional methods and assays. _ 
In one working example, constitutive ectopic expression of the NPRJ gene of 
Arabidopsis (Fig. 5; SEQ ID NO:2) or the NPRI homolog of Nicotiana glutinosa 
(Fig. 7 A; SEQ ID NO: 13) in Russet Burbank potato is used to control Phytophthora infestans 
infection. In one particular example, a plant expression vector is constructed that contains an 

25 NPRI cDN A sequence expressed under the control of the enhanced CaMV 35S promoter as 
described by McPherson and Kay (U.S. Patent 5,359,142). This expression vector is then 
used to transform Russet Burbank according to the methods described in Fischhoff et al. 
(U.S. Patent 5,500,365). To assess resistance to fungal infection, transformed Russet 
Burbank and appropriate controls are grown to approximately eight-weeks-old, and leaves 

30 (for example, the second or third from the top of the plant) are inoculated with a mycelial 
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suspension of P. infestans. Plugs of P. infestans mycelia are inoculated on each side of the 
leaf mid vein. Plants are subsequently incubated in a growth chamber at 27 °C with constant 
fluorescent light. 

Leaves of transformed Russet Burbank and control plants are then evaluated for 
5 resistance to P. infestans infection according to conventional experimental methods. For this 
evaluation, the number of lesions per leaf and percentage of leaf area infected are recorded 
every twenty-four hours for seven days after inoculation. From these data, levels of 
resistance to P. infestans are determined. Transformed potato plants that express an NPR1 
gene having an increased level of resistance to P. infestans relative to control plants are taken 

10 as being useful in the invention. 

Alternatively, to assess resistance at the whole plant level, transformed and control 
plants are transplanted to potting soil containing an inoculum of P. infestans. Plants are then 
evaluated for symptoms of fungal infection (for example, wilting or decayed leaves) over a 
period of time lasting from several days to weeks. Again, transformed potato plants 

1 5 expressing the NPRI gene having an increased level of resistance to the fungal pathogen, P. 
infestans, relative to control plants are taken as being useful in the invention. 

In another working example, expression of the NPRI homolog of Nicotiana glutinosa 
in tomato is used to control bacterial infection, for example, to Pseudomonas syringae. 
Specifically, a plant expression vector is constructed that contains the cDNA sequence of the 

20 NPRI homolog from Nicotiana glutinosa (Fig. 7 A; SEQ ID NO:l 3) which is expressed 

under the control of the enhanced CaMV 35S promoter as described by McPherson and Kay, 
supra. This expression vector is then used to transform tomato plants according to the 
methods described in FischhofF et al., supra. To assess resistance to bacterial infection, 
transformed tomato plants and appropriate controls are grown, and their leaves are inoculated 

25 with a suspension of P. syringae according to standard methods, for example, those described 
herein. Plants are subsequently incubated in a growth chamber, and the inoculated leaves are 
subsequently analyzed for signs of disease resistance according to standard methods. For 
example, the number of chl orotic lesions per leaf and percentage of leaf area infected are 
recorded and evaluated after inoculation. From a statistical analysis of these data, levels of 

30 resistance to P. syringae are determined. Transformed tomato plants that express an NPRI 
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homolog of Nicotiana glutinosa gene having an increased level of resistance to P. syringae 
relative to control plants are taken as being useful in the invention. 

In still another working example, expression of the NPR1 homolog of rice is used to 
control fungal diseases, for example, the infection of tissue by Magnaporthe grisea, the cause 
5 of rice blast. In one particular approach, a plant expression vector is constructed that contains 
the cDNA sequence of the rice NPR1 homolog that is constitutively expressed under the 
control of the rice actin promoter described by Wu et al. (WO 91/09948). This expression 
vector is then used to transform rice plants according to conventional methods, for example, 
using the methods described in Hiei et al (Plant Journal 6:271-282, 1994). To assess 

1 0 resistance to fungal infection, transformed rice plants and appropriate controls are grown, and 
their leaves are inoculated with a mycelial suspension of M. grisea according to standard 
methods. Plants are subsequently incubated in a growth chamber and the inoculated leaves 
are subsequently analyzed for disease resistance according to standard methods. For 
example, the number of lesions per leaf and percentage of leaf area infected are recorded and 

1 5 evaluated after inoculation. From a statistical analysis of these data, levels of resistance to M. 
grisea are determined. Transformed rice plants that express a rice NPR1 homolog having an 
increased level of resistance to M. grisea relative to control plants are taken as being useful in 
the invention. 

.20 AR Interacting Polypeptides 

The isolation of AR sequences also facilitates the identification of polypeptides which 
interact with the AR protein. Such polypeptide-encoding sequences are isolated by any 
standard two hybrid system (see, for example, Fields et al., Nature 340:245-246, 1989; Yang 
et aL, Science 257:680-682, 1992; Zervos et al, Cell 72:223-232, 1993). For example, all or 

25 a part of the AR sequence may be fused to a DNA binding domain (such as the GAM or 
LexA DNA binding domain). After establishing that this fusion protein does not itself 
activate expression of a reporter gene (for example, a lacZ or LEU2 reporter gene) bearing 
appropriate DNA binding sites, this fusion protein is used as an interaction target. Candidate 
interacting proteins fused to an activation domain (for example, an acidic activation domain) 

30 are then co-expressed with the AR fusion in host cells, and interacting proteins are identified 
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by their ability to contact the AR sequence and stimulate reporter gene expression. AR- 
interacting proteins identified using this screening method provide good candidates for 
proteins that are involved in the acquired resistance signal transduction pathway. 
Antibodies 

5 AR polypeptides described herein (or imunogenic fragments or analogs) may be used 

to raise antibodies useful in the invention; such polypeptides may be produced by 
recombinant or peptide synthetic techniques (see, e.g., Solid Phase Peptide Synthesis, 2nd 
ed., 1984, Pierce Chemical Co., Rockford, IL; Ausubel et al., supra). The peptides may be 
coupled to a carrier protein, such as KLH as described in Ausubel et al, supra. The KLH- 

1 0 peptide is mixed with Freund's adjuvant and injected into guinea pigs, rats, or preferably 
rabbits. Antibodies may be purified by peptide antigen affinity chromatography. 

Monoclonal antibodies may be prepared using the AR polypeptides described above 
and standard hybridoma technology (see, e.g., Kohler et al., Nature 256:495, 1975; Kohler et 
al., Eur. J. Immunol 6:511, 1976; Kohler et al., Eur. J. Immunol. 6:292, 1976; Hammerling et 

1 5 al., In Monoclonal Antibodies and T Cell Hyhridomas, Elsevier, NY, 1981; Ausubel et al., 
supra). 

Once produced, polyclonal or monoclonal antibodies are tested for specific AR 
recognition by Western blot or immunoprecipitation analysis (by the methods described in 
Ausubel et al., supra). Antibodies which specifically recognize AR polypeptides are 
20 considered to be useful in the invention; such antibodies may be used, e.g., in an 
immunoassay to monitor the level of AR polypeptide produced by a plant. 
Use 

The invention described herein is useful for a variety of agricultural and commercial 
purposes including, but not limited to, improving acquired resistance against plant pathogens, 
25 increasing crop yields, improving crop and ornamental quality, and reducing agricultural 
production costs. In particular, ectopic expression of an AR gene in a plant cell provides 
acquired resistance to plant pathogens and can be used to protect plants from pathogen 
infestation that reduces plant productivity and viability. 

The invention also provides for broad-spectrum pathogen resistance by facilitating the 
30 natural mechanism of host resistance. For example, AR transgenes can be expressed in plant 
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cells at sufficiently high levels to initiate an acquired resistance plant defense response 
constitutively in the absence of signals from the pathogen. The level of expression associated 
with such a plant defense response may be determined by measuring the levels of defense 
response gene expression as described herein or according to any conventional method. If 
5 desired, the AR transgenes are expressed by a controllable promoter such as a tissue-specific 
promoter, cell-type specific promoter, or by a promoter that is induced by an external signal 
or agent such as a pathogen- or wound-inducible control element, thus limiting the temporal 
or tissue expression or both of an acquired resistance defense response. The AR genes may 
also be expressed in roots, leaves, or fruits, or at a site of a plant that is susceptible to 

1 0 pathogen penetration and infection. 

The invention is also useful for controlling plant disease by enhancing a plant's SAR 
defense mechanisms. In particular, the invention is useful for combating diseases known to 
be inhibited by plant SAR defense mechanisms. These include, without limitation, viral 
diseases caused by TMV and TNV, bacterial diseases caused by Pseudomonas and 

1 5 Xanthomonas, and fungal diseases caused by Erysiphe, Peronospora, Phytophthora, 

Colletotrichum, and Magnaporthe grisea. In particular exemplary approaches, constitutive or 
inducible expression of an AR gene in a transgenic plant is useful for controlling powdery 
mildew of wheat caused by Erysiphe, bacterial leaf spot of pepper caused by Xanthomonas 
campestris, bacterial wilt and bacterial spot of tomato caused by Pseudomonas syringae and 

20 Xanthomonas campestris, and bacterial blights of citrus and walnut caused by Xanthomonas 
campestris. 

Other Embodiments 
The invention further includes analogs of any naturally-occurring plant AR 
25 polypeptide. AnaJogs can differ from the naturally-occurring AR protein by amino acid 
sequence differences, by post-translational modifications, or by both. Analogs of the 
invention will generally exhibit at least 40%, more preferably 50%, and most preferably 60% 
or even having 70%, 80%, or 90% identity with all or part of a naturally-occurring plant AR 
amino acid sequence. The length of sequence comparison is at least 15 amino acid residues, 
30 preferably at least 25 amino acid residues, and more preferably more than 35 amino acid 
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residues. Modifications include in vivo and in vitro chemical derivatization of polypeptides, 
e.g., acetylation, carboxylation, phosphorylation, or glycosylation; such modifications may 
occur during polypeptide synthesis or processing or following treatment with isolated 
modifying enzymes. Analogs can also differ from the naturally-occurring AR polypeptide by 
5 alterations in primary sequence. These include genetic variants, both natural and induced (for 
example, resulting from random mutagenesis by irradiation or exposure to ethyl 
methylsulfate or by site-specific mutagenesis as described in Sambrook, Fritsch and Maniatis, 
Molecular Cloning: A Laboratory Manual (2d ed.), CSH Press, 1989, or Ausubel et al„ 
supra). Also included are cyclized peptides, molecules, and analogs which contain residues 

10 other than L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino 
acids, e.g., P or y amino acids. 

In addition to full-length polypeptides, the invention also includes AR polypeptide 
fragments. As used herein, the term "fragment," means at least 20 contiguous amino acids, 
preferably at least 30 contiguous amino acids, more preferably at least 50 contiguous amino 

1 5 acids, and most preferably at least 60 to 80 or more contiguous amino acids. Fragments of 
AR polypeptides can be generated by methods known to those skilled in the art or may result 
from normal protein processing (e.g., removal of amino acids from the nascent polypeptide 
that are not required for biological activity or removal of amino acids by alternative mRNA 
splicing or alternative protein processing events). In preferred embodiments, an AR 

20 polypeptide fragment includes an ankyrin-repeat motif as described herein. In other preferred 
embodiments, an AR fragment is capable of interacting with a second polypeptide component 
of the AR signal transduction cascade. 

Furthermore, the invention includes nucleotide sequences that facilitate specific 
detection of an AR nucleic acid. Thus, AR sequences described herein or portions thereof 

25 may be used as probes to hybridize to nucleotide sequences from other plants (e.g., dicots, 
monocots, gymnosperms, and algae) by standard hybridization techniques under conventional 
conditions. Sequences that hybridize to an AR coding sequence or its complement and that 
encode an AR polypeptide are considered useful in the invention. As used herein, the term 
"fragment," as applied to nucleic acid sequences, means at least 5 contiguous nucleotides, 

30 preferably at least 10 contiguous nucleotides, more preferably at least 20 to 30 contiguous 
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nucleotides, and mosl preferably at least 40 to 80 or more contiguous nucleotides. Fragments 
of AR nucleic acid sequences can be generated by methods known to those skilled in the art. 

5 

Deposit 

Cosmids 21A4-2-1, 21A4-4-3-1, 21A4-P5-1 have been deposited with the American 
Type Culture Collection on July 8, 1 996, and bear the accession numbers ATCC No. 97649, 
97650, and 9765 1 . Plasmid pKExNPRl was deposited on July 31, 1996 and bears the 

1 0 accession number ATCC No. 9767 1 . Applicants acknowledge their responsibility to replace 
these plasmids should it loose viability before the end of the term of a patent issued hereon, 
and their responsibility to notify the American Type Culture Collection of the issuance of 
such a patent, at which time the deposit will be made available to the public. Prior to that 
time the deposit will be made available to the Commissioner of Patents under terms of 37 

1 5 CFR §1.14 and 35 USC § 1 1 2. These deposits are available as required by foreign patent 
laws in countries wherein counterparts of this subject application, or progeny, are filed. It 
should be understood that availability of a deposit does not constitute a license to practice the 
subject invention. 



20 



All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each independent publication or patent 
application was specifically and individually indicated to be incorporated by reference. 



WO 98/06748 



PCTAJS97/13994 



-64- 



SEQUENCE LISTING 

( 1 ) GENERAL INFORMATION 

(i) APPLICANT: The General Hospital Corporation et al. 

(ii) TITLE OF THE INVENTION: 

ACQUIRED RESISTANCE GENES AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 28 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/ — 

(B) FILING DATE: 08-AUG-97 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/023,851 

(B) FILING DATE: August 9, 1996 

(A) APPLICATION NUMBER: 60/035,166 

(B) FILING DATE: January 10, 1997 

(A) APPLICATION NUMBER: 60/046,769 

(B) FILING DATE: May 16, 1997 
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(viii) ATTORNEY/AGENT INFORMATION : 

(A) NAME: Elbing, Karen L 

(B) REGISTRATION NUMBER: 35,238 

(C) REFERENCE/DOCKET NUMBER: 00786/3 39 W04 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 61 7-428-0200 

(B) TELEFAX: 617-428-7045 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

AAGCTTGTGA TGCAAGTCAT GGGATATTGC TTTGTGTTAA GTATACAA AA CCATCACGTG 60 

GATACATAGT CTTCA A ACCA ACCACTAAAC AGTATCAGGT CATACCA AAG CCAG AAGTG A 1 20 

AGGGTTGGGA TATGTCATTG GGTTTAGCGG TAATCGGATT GA ACCCTTTC CGGTATAA AA 1 80 

TACAAAGGCT TTCGCAGTCT CGGCGTATGT GTATGTCTCG GGGTATCTAC CATTTGAATC 240 

ACAGAACTTT TATGTGCGAA GTTTTCGATT CTGATTCGTT TACCTGGAAG AGATTAGAAA 300 

TTTGCGTCTA CCAAA A ACAG ACAGATTAAT TTTTTCCAAC CCG ATACA AG TTTCGGGGTT 360 

CTTGC ATTGG ATATCACGG A ACA ACA ATGT GATCCGGTTT TGTCTCA A A A CCG A AACTTG 420 

GTCCTTCTTC CATACTCCGA ACTCTGATGT TTTCTCAGGA TTAGTCAGAT ACG AAGGGAA 480 

GCTAGGTGCT ATTCGTCAGT GGACAAACAA AGATCAAGAA GATGTTCACG AGTTATGGGT 540 

TTTAA AGAGC AGTTTTGAAA AGTCGTGGGT TAA AGTGAAA G ATATTA AAA GCATTGGAGT 600 

AGATTTGATT ACGTGGACTC CAAGCAACGA CGTTGTATTG TTTCGTAGTA GTGATCGTGG 660 

TTGCCTCTAC AACATAA ACG CAGAGAAGTT G AATTTAGTT TATGCAAAA A AAGAGGGATC 720 

TG ATTGTTCT TTCGTTTGTT TTCCGTTTTG TTCTG ATTAC G AG AGGGTTG ATCTG AACGG 780 

AAGAAGCAAC GGGCCGACAC TTTAAAAAAA AAATAAAAAA AATGGGCCGA CAAATGCAAA 840 

CGTAGTTGAC A AGGATCTCA AGTCTCA AGT CTCAATTGGC TCGCTCATTG TGGGGCATAA 900 

ATATATCTAG TGATGTTTAA TTGTTTTTTA TAAGGTAAAA AGGAATATTG AATTTTGTTT 960 

CTTAGGTTTA TGTAATAATA OCA AACATTG TTTTATG AAT ATTTA ATCTG ATTTTTTGGC 1 020 

TAGTTATTTT ATTATATCAA GGGTTCCTGT TTATAGTTGA A AACAGTTAC TGTATAGAAA 1 080 

ATAGTGTCCC A ATTTTCTCT CTT AA ATA AT ATATTAGTTA ATAAA AGATA TTTTA ATATA 1 1 40 

TTAGATATAC A ATA ATATCT AAAGCAACAC ATATTTAGAC ACAACACGTA ATATCTTACT 1 200 

ATTGTTTACA TATATTTATA GCTTACCA AT ATAACCCGTA TCTATGTTTT ATAAGCTTTT 1 260 

ATACAATATA TGTACGGTAT GCTGTCCACG TATATATATT CTCCAAA AAA A ACGCATGGT 1 320 

ACACAAAATTTATTAAATAT TTGGCAATTG GGTGTTTATC TAAAGTTTAT CACAATATTT 1380 

ATCAACTATA ATAGATGGTA GAAGATAAAA AAATTATATC AGATTGATTC A ATTAAATTT 1 440 

TATAATATAT CATTTTAAAA AATTAATTAA AAGAAAACTA TTTCATAAAA TTGTTCAAA A 1 500 

GATAATTAGT A AAATTAATT AAATATGTGA TGCTATTGA A TTATAG AG AG TTATTGTAAA 1 560 

7TTACTTAAA ATCATACA A A TCTTATCCTA ATTTAACTTA TCATTTAAG A A ATACAAAAG \ 620 

TAA AAA ACGC GG AA AGCAAT A ATTTATTTA CCTTATTATA ACTCCTATAT AAAGTACTCT 1 680 

GTTTATTCAA CATAATCTTA CGTTGTTGTA TTCATAGGCA TCTTTAACCT ATCTTTTCAT 1 740 
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TTTCTC ATCT CGATCGTTTT CGATCCAACA AAATGAGTCT ACCGGTGAGG AACCA AGAGG 1 800 

TGATTATGCA GATTCCTTCT TCTTCTCAGT TTCCAGCA AC ATCG AGTCCG GAA AACACCA 1 860 

ATCA AGTG A A GG ATGAGCC A A ATTTGTTTA G ACGTGTTAT G A ATTTGCTT TTACGTCGTA 1 920 

GTTATTGAAA AAGCTGATTT ATCGCATGAT TCAGAACGAG AAGTTGAAGG CAAATAACTA 1980 

AAGAAGTCTT TTATATGTAT ACAATAATTG TTTTTAAATC AAATCCTA AT TAAAAAAATA 2040 

TATTC ATTAT G ACTTTCATG TTTTTAATGT A ATTTATTCC TATATCTATA ATGATTTTTG 2 1 00 

TTGTG A AG AG CGTTTTCATT TGCTATAGA A CA AGG AG A AT AGTTCCAGGA A ATATTCG AC 2 1 60 

TTGATTTAAT TATAGTGTA A ACATGCTGA A CACTG AAAAT TACTTTTTCA ATAAACGA AA 2220 

A ATATAATAT ACATTACAA A ACTTATGTGA ATAAAGCATG AGACTTAATA TACGTTCCCT 2280 

TTATCATTTT ACTTCAAAGA AAATAAACAG AAATGTAACT TTCACATGTA AATCTAATTC 2340 

TTAAATTTA A AAA ATA AT AT TTATATATTT ATATGAAAAT AACG AACCGG ATGAA AA ATA 2400 

AATTTTATAT ATTTATATCA TCTCCAAATC TAGTTTGGTT CAGGGGCTTA CCGAACCGGA 2460 

TTGAACTTCT CATATACAAA AATTAGCAAC ACAAAATGTC TCCGGTATA A ATACTAACAT 2520 

TTATA ACCCG AACCGGTTTA GCTTCCTGTT ATATCTTTTT AAAAAAGATC TCTGACAAAG 2580 

ATTCCTTTCC TGGA AATTTA CCGGTTTTGG TGA AATGTAA ACCGTGGG AC GAGGATGCTT 2640 

CTTCATATCT CACCACCACT CTCGTTGACT GG ACTTGGCT CTGCTCGTCA ATGGTTATCT 2700 

TCGATCITA A ACCAAATCCA GTTGATAAGG TCTCTTCGTT GATTAGCAGA GATCTCTTTA 2760 

ATTTGTGAAT TTCAATTCAT CGGAACCTGT TGATGGACAC CACCATTGAT GGATTCGCCG 2820 

ATTCTTATGA AATCAGCAGC ACTAGTTTCG TCGCTACCGA TA AC ACCG AC TCCTCTATTG 2880 

TTTATCTGGC CGCCGAACAA GTACTCACCG GACCTGATGT ATCTGCrCTG CAATTGCTCT 2940 

CCA ACAGCTT CGAATCCGTC TTTGACTCGC CGGATGATTT CTACAGCGAC GCTA AGCTTG 3000 

1TCTCTCCGA CGGCCGGGAA GTTTCTTTCC ACCGGTGCGT TTTGTCAGCG AGA AGCTCTT 3060 

TCTTCA AG AG CGCTTTAGCC GCCGCTA AGA AGGAG AA AG A CTCCA ACAAC ACCGCCGCCG 3 1 20 

TG A AGCTCGA GCTTAAGGAG ATTGCCA AGG ATTACGA AGT CGGTTTCGAT TCGGTTGTGA 3 1 80 

CTGTTTTGGC TTATGTTTAC AGCAGCAGAG TGAGACCGCC GCCTAAAGGA GTTTCTGAAT 3240 

GCGCAGACGA GAATTGCTGC CACGTGGCTT GCCGGCCGGC GGTGGATTTC ATGTTGGAGG 3300 

TTCTCTATTT GGCTTTCATC TTCAAGATCC CFGAATTAAT TACTCTCTAT CAGGTAAAAC 3360 

ACCATCTGCA TTAAGCTATG GTTACACATT CATGAATATG TTCTTACTTG AGTACTTGTA 3420 

TTTGTATTTC AGAGGCACTT ATTGGACGTT GTAGACAAAG TTGTTATAGA GG ACACATTG 3480 

GTTATACTCA AGCTTGCTAA TATATGTGGT AAAGCTTGTA TGAAGCTATT GG ATAGATGT 3540 

AAAGAGATTA TTGTCA AGTC TAATGTAGAT ATGGTTAGTC TTGAAAAGTC ATTGCCGGAA 3600 

GAGCTTGTTA A AGAGATA AT TGATAGACGT A A AG AGCTTG GTTTGGAGGT ACCTA AAGTA 3660 

AAGAAACATG TCTCGAATGT ACATAAGGCA CTTGACTCGG ATGATATTGA GTTAGTCAAG 3720 

TTGCTTTTGA AAGAGGATCA CACCAATCTA GATGATGCGT GTGCTCTTCA TTTCGCTGTT 3780 

GCATATTGCA ATGTGA AGAC CGCAACAGAT CTTTTAAAAC TTGATCTTGC CGATGTCAAC 3840 

CATAGGAATC CGAGGGGATA TACGGTGCTT CATGTTGCTG CGATGCGGAA GGAGCCACAA 3900 

TTGATACTAT CTCTATTGGA AAAAGGTGCA AGTGCATCAG AAGCAACTTT GGAAGGTAGA 3960 

ACCGCACTCA TGATCGCAAA ACAAGCCACT ATGGCGGTTG AATGTAATAA TATCCCGGAG 4020 

CAATGCAAGC ATTCTCTCAA AGGCCGACTA TGTGTAGAAA TACTAGAGCA AGAAGACAAA 4080 

CGAGAACAAA TTCCTAGAGA TGTTCCTCCC TCTTTTGCAG TGGCGGCCGA TGAATTGAAG 4140 . 

ATGACGCTGC TCGATCTTGA AAATAG AGGT ATCT ATCA AG TCTTATTTCT TATATGTTTG 4200 

AATTAAATTT ATGTCCTCTC TATTAGGAAA CTGAGTGAAC TAATG ATAAC TATTCTTTGT 4260 

GTCGTCCACT GTTTAGTTGC ACTTGCTCAA CGTCTTTTTC CA ACGGAAGC ACAAGCTGCA 4320 

ATGGAGATCG CCGAAATGAA GGG A AC ATGT GAGTTCATAG TGACTA GCCT CGAGCCTGAC 4380 

CGTCTCACTG GTACGA AGAG AACATCACCG GGTGTAA AGA TAGCACCTTT CAGAATCCTA 4440 

GAAGAGCATC AAAGTAGACT AAAAGCGCTT TCTAAA ACCG GTATGGATTC TCAC CCACTT 4500 

CATCGGACTC CTTATCACA A AAAACAAAAC TAAATGATCT TTAAACATGG TTTTGTTACT 4560 

TGCTGTCTGA CCTTGTTTTT TTATCATCAG TGGAACTCGG G AAACGATTC TTCCCGCGCT 4620 

GTTCGGCAGT GCTCGACCAG ATTATGAACT GTGAGGACTT GACTCAACTG GCTTGCGGAG 4680 
AAGACGACAC TGCTGAAGAA ACGACTACAA AAGAAGCAAA GGTACATGGA AATACAAGAG 4740 

ACACTAAAG A AGGCCTTTAG TGAGG ACA AT TTGGAATTAG G A A ATTCGTC CCTG ACAGAT 4800 

TCGACTTCTT CCACATCGAA ATCAACCGGT GGAAAGAGGT CTAACCGTAA ACTCTCTCAT 4860 

CGTCGTCGGT GAGACTCTTG CCTCTTAGTG TAATTTTTGC TGTACCATAT AATTCTGTTT 4920 

TCATGATGAC TGTAACTGTT TATGTCTATC GTTGGCGTCA TATAGTTTCG CTCTTCGTTT 4980 

TGC ATCCTGT GTATTATTGC TGC AGGTGTG CTTC AA ACA A ATGTTGTA AC A ATTTG A ACC 5040 

AATGGTATAC AGATTTGTAA TATATATTTA TGTACATCAA CA ATAACCCA TGATGG TGTT 5 1 00 

ACAGAGTTGC TAG AATCAAA GTGTGA AATA ATGTCA AATT GTTCATCTGT TGG ATATTTT 5 1 60 

CCACCAAGA A CCAAAAGAAT ATTCAAGTTC CCTGAACTTC TGGCAACATT CATGTTATAT 5220 

GTATCTTCCT AATTCTTCCT TTAACCTTTT GTAACTCGAA TTACACAGCA AGTTAGTTTC 5280 

AGGTCTAGAG ATAAGAGAAC ACTGAGTGGG CGTGTAAGGT GCATTCTCCT AGTCAGCTCC 5340 
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ATTGCATCCA ACATTTGTGA ATGACACAAG TTA ACAATCC TTTGCACCAT TTCTGGGTGC 5400 

ATACATGGAA A C T I CI ICG A TTGA AACTTC CCACATGTGC AGGTGCGTTC GCTGTCACTG 5460 

ATAGACCAAG AGACTGAAAG CTTTCACA AA TTGCCCTCAA ATCTTCTGTT TCTATCGTCA 5520 

TGACTCCATA TCTCCGACCA CTGGTCATGA GCCAGAGCCC ACTGATTTTG AGGGAATTGG 5580 

GCTA ACCATT TCCGAGCTTC TGAGTCCTTC TTTTTG ATGT CCTTTATGTA GG AATCA A AT 5640 

TCTTCCTTCT GACTTGTGGA TCCAGCCTGC TTCACA AGGC TCACCAGGTT GTAGTCTCCA 5700 

A AA ATATCAT GGAATTGTAA GCAAAAACAA TCCAGACAGA ACCTGTGATA GACCCAAGGT 5760 

TCTTGCCACA GTGATCCGGG TTCGTTAATA ACAGCAACTA TGTCCGGGTG AGGACTGGAG 5820 

ACGAAGCAAA CGTCTTTCCT TTGTGTTACC TTCTCTCTGA TATTAGTGAG AAACCAACGC 5880 

CAACTATCAG TGG ACACTTC TTTGGTAAGC GGAAAGCAAG CGGGAAAAAC AATCA TCAGC 5940 

GTCGAGTCCT GAGGAA AATC ATCA ATTTCA TAGGGGTACT TGCCGTTCAA GTCTTTTGAA 6000 

TCCACTATGA TCAGAGGTCT ACAGTGTTGA AACCCTTCAA TGGACTGTGG AAACGCCCAA 6060 

AACGCGCCAC CGAAGGATGC AAATTCAGGA TTAGGGAAAA GCTCATATTG CAGTCCACAA 6120 

GTAGCCCATT AGATGAGTGA AATGCAGCCA ATTAGTTTAG GCAATACTCT GAAACTCTGA 6180 

TCTTTGATTA CTTCCTGTTC TGCTGCCCGC AGCTTTG AAG TTTTAAGCAT GTCACCAAAC 6240 

TTTTCA ACTC TGCTGTTAGA GTGGGTTGTA CCCTGATCAG ACACTCAATC TCTTCTGCTG 6300 

CAA ATTACA A GTTG A AGTTT TCCGGCTTA A TAG A ACAACA AGTAT GTGG A CCA ACTA CAC 6360 

TTAGTTATCT TAACAAGTCC ATGTTCTTCT ATTCA ATCTG CCCGACGCG A CCA ATTGCAT 6420 

TTCCATCTGA TGCATTTAAA CGTATACTCG TCCTTCTCAA TCTCTTGTAC TACACACTTT 6480 

TGCTGCCCTC TAATGGAACA CCAGTCCACC GCCTTCTTCA GCTCATCCCT ATCTTTAAAA 6540 

CACAACCCTA CACGCAATTC ATGATCATCA ATCCACAAAC TAGACAA AGT ACACTGTTTT 6600 

GAAGCACTCG AATCAACAAC ACCTTTACTT AATA AGCACG CATACGGTAA TACCTCTAAG 6660 

CCTGGCACAT TCA AACCTTG TGTGCATCAT CTG AACCCGA GTTTTTATCC GTTATTTCTC 6720 

CATCCCCACC TCCACGAGTG CTACCATTTC CGAAGTCAGA ATTTTCCTCG TCTTCAATCC 6780 

ACCCGTTACT GTTACCCACT CCCTGAACCT CTAAACCATT ATCTCTCTCT ACTTTCACAG 6840 

ATGCATGTGA CACATAATCA GTAGCTTCTT GGGGTTGTTG CGTCCTCTGT GTATTCGAGG 6900 

AACTAGCGGG ATATTCTATT ACGGATGAAC AAGCAGCATG ATCAGTAACA TTATCAGATG 6%0 

TCGATTTCAC TTCCAA ATAC AACTCCACAT TTCTTATAGA AGGATGATA A CTTGGAACTT 7020 

CAAGCATAGT CTCCAAACTA GTGTCGTTCA CTACATGAAG AAGTAGATAG ATAAAGAGAT 7080 

CCGGTGAAAC AACTACAGGA TACTTACCAA AATATATTGA ACACTGATTT CTGCAGCTGC 7140 

AATCCAA AAA TTGGATAAAG ACCATTCAAC AATGTACTTA ACGCAGTCTT TTGCCTAACC 7200 

TTGACCGTTT TAGG AGTGGA TCCTTCATAG TAA ACACCAT CAGGACCATA CTTGGTAGAA 7260 

CCTTTCTCTC A AGGTTTCCA TCGCCATGAC CATAACAGTC CTGCAGTGAA TTCTAAGAAA 7320 

AATGTAAAAA ATTTTGGCCT AAACTCATAA TTCTTAACAT ACGAAACCAT GGAGAACTCC 7380 

ATGTCTAAAA AATAAAGGCT AAAGCTTTTT GGCGACAGAA GCAGATAAAT CCATTCAAAA 7440 

CACATAAACT CTAAACAATA A ACAGTGATA CTCAATACTA AGACTTGTAA AGGTCTACGT 7500 

AACTCAAAAC TGGAGAATTG TCAGATCGGG TGTGGCTACT AGAAGCTT 7548 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 93...1871 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
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TCGATCTTTA ACCAAATCCA GTTGATAAGG TCTCTTCGTT GATTAGCAGA GATCTCTTTA 60 
ATTTGTGA AT TTCAATTCAT CGGAACCTGT TG ATG GAC ACC ACC ATT GAT GGA 1 1 3 

Met Asp Thr Thr lie Asp Gly 

1 5 

TTC GCC GAT TCT TAT G A A ATC AGC AGC ACT AGT TTC GTC GCT ACC GAT 1 6 1 

Phe Ala Asp Scr Tyr Glu He Ser Ser Thr Ser Phe Val Ala Thr Asp 
10 15 20 

A AC ACC GAC TCC TCT ATT GTT TAT CTG GCC GCC G A A CA A GTA CTC ACC 209 
Asn Thr Asp Scr Ser lie Val Tyr Leu Ala Ala Glu Gin Val Leu Thr 
25 30 35 

GGA CCT GAT GTA TCT GCT CTG CAA TTG CTC TCC A AC AGC TTC GAA TCC 257 
Gly Pro Asp Val Ser Ala Leu Gin Leu Leu Ser Asn Ser Phe Glu Ser 
40 45 50 55 

GTC TTT GAC TCG CCG GAT GAT TTC TAC AGC GAC GCT A AG CTT GTT CTC 305 
Val Phe Asp Scr Pro Asp Asp Phe Tyr Ser Asp Ala Lys Leu Val Leu 
60 65 70 

TCC GAC GGC CGG GAA GTT TCT TTC CAC CGG TGC GTT TTG TCA GCG AGA 353 
Scr Asp Gly Arg Glu Val Ser Phe His Arg Cys Val Leu Ser Ala Arg 
75 80 85 

AGC TCT TTC TTC A AG AGC GCT TTA GCC GCC GCT AAG AAG GAG AAA GAC 401 
Scr Scr Phe Phe Lys Ser Ala Leu Ala Ala Ala Lys Lys Glu Lys Asp 
90 95 100 

TCC AAC AAC ACC GCC GCC GTG AAG CTC GAG CTT AAG GAG ATT GCC AAG 449 
Ser Asn Asn Thr Ala Ala Vat Lys Leu Glu Leu Lys Glu lie Ala Lys 
105 110 115 

GAT TAC GAA GTC GGT TTC GAT TCG GTT GTG ACT GTT TTG GCT TAT GTT 497 
Asp Tyr Glu Val Gly Phe Asp Ser Val Vai Thr Val Leu Ala Tyr Val 
120 125 130 135 

TAC AGC AGC AGA GTG AGA CCG CCG CCT AAA GGA GTT TCT GAA TGC GCA 545 
Tyr Ser Scr Arg Val Arg Pro Pro Pro Lys Gly Val Ser Glu Cys Ala 
140 145 150 

GAC GAG AAT TGC TGC CAC GTG GCT TGC CGG CCG GCG GTG GAT TTC ATG 593 
Asp Glu Asn Cys Cys His Val Ala Cys Arg Pro Ala Val Asp Phe Met 
155 160 165 

TTG GAG GTT CTC TAT TTG GCT TTC ATC TTC AAG ATC CCT GAA TTA ATT 64 1 

Leu Glu Val Leu Tyr Leu Ala Phe He Phe Lys lie Pro Glu Leu lie 
170 175 180 

ACT CTC TAT CAG AGG CAC TTA TTG GAC GTT GTA GAC AAA GTT GTT ATA 689 
Thr Leu Tyr Gin Arg His Leu Leu Asp Val Val Asp Lys Val Val lie 
185 190 195 

GAG GAC ACA TTG GTT ATA CTC AAG CTT GCT AAT ATA TGT GGT AAA GCT 737 
Glu Asp Thr Leu Val He Leu Lys Leu Ala Asn He Cys Gly Lys Ala 
200 205 210 215 
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TGT ATG AAG CTA TTG GAT AGA TGT AAA GAG ATT ATT GTC A AG TCT AAT 785 
Cys Met Lys Leu Leu Asp Arg Cys Lys Glu lie He Val Lys Ser Asn 
220 225 230 

GTA GAT ATG GTT AGT CTT GA A AAG TCA TTG CCG GAA GAG CTT GTT AAA 833 
Val Asp Met Val Ser Leu Glu Lys Scr Leu Pro Glu Glu Leu Val Lys 
235 240 245 

GAG ATA ATT GAT AGA CGT AAA GAG CTT GGT TTG GAG GTA CCT AAA GTA 88 1 

Glu He lie Asp Arg Arg Lys Glu Leu Gly Leu Glu Val Pro Lys Val 
250 255 260 

AAG AAA CAT GTC TCG AAT GTA CAT AAG GCA CTT G AC TCG GAT GAT ATT 929 
Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp Ser Asp Asp lie 
265 270 275 

GAG TTA GTC AAG TTG CTT TTG AAA GAG GAT CAC ACC AAT CTA GAT GAT 977 
Glu Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr Asn Leu Asp Asp 
280 285 290 295 

GCG TGT GCT CTT CAT TTC GCT GTT GCA TAT TGC AAT GTG AAG ACC GCA 1 025 

Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn Val Lys Thr Ala 
300 305 310 

ACA GAT CTT TTA AAA CTT GAT CTT GCC GAT GTC A AC CAT AGG AAT CCG 1073 
Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn His Arg Asn Pro 
315 320 325 

AGG GGA TAT ACG GTG CTT CAT GTT GCT GCG ATG CGG AAG GAG CCA CAA 1121 
Arg Gly Tyr Thr Val Leu His Val Ala Ala Mel Arg Lys Glu Pro Gin 
330 335 340 

TTG ATA CTA TCT CTA TTG GAA AAA GGT GCA AGT GCA TCA GAA GCA ACT 1 ! 69 

Leu lie Leu Ser Leu Leu Glu Lys Gly Ala Ser Ala Scr Glu Ala Thr 
345 350 355 

TTG GAA GGT AGA ACC GCA CTC ATG ATC GCA AAA CAA GCC ACT ATG GCG 1217 
Leu Glu Gly Arg Thr Ala Leu Met lie Ala Lys Gin Ala Thr Met Ala 
360 365 370 375 

GTT GAA TGT AAT AAT ATC CCG GAG CAA TGC AAG CAT TCT CTC AAA GGC 1265 
Val Glu Cys Asn Asn lie Pro Glu Gin Cys Lys His Ser Leu Lys Gly 
380 385 390 

CG A CTA TGT GTA GAA ATA CTA GAG CAA GAA GAC AAA CG A GAA CAA ATT 1313 
Arg Leu Cys Val Glu He Leu Glu Gin Glu Asp Lys Arg Glu Gin He 
395 400 405 

CCT AGA GAT GTT CCT CCC TCT TTT GCA GTG GCG GCC GAT GAA TTG AAG ! 36 1 

Pro Arg Asp Val Pro Pro Ser Phe Ala Val Ala Ala Asp Glu Leu Lys 
410 415 420 

ATG ACG CTG CTC GAT CTT GAA AAT AGA GTT GCA CTT GCT CAA CGT CTT 1409 
Met Thr Leu Leu Asp Leu Glu Asn Arg Val Ala Leu Ala Gin Arg Leu 
425 430 435 

TTT CCA ACG GAA GCA CAA GCT GCA ATG GAG ATC GCC GAA ATG AAG GGA 1 457 

Phe Pro Thr Glu Ala Gin Ala Ala Met Glu He Ala Glu Met Lys Gly 
440 445 450 455 
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ACA TGT GAG TTC ATA GTG ACT AGC CTC GAG CCT G AC CGT CTC ACT GGT 1 505 

Thr Cys GIu Phe lie Val Thr Ser Leu Glu Pro Asp Arg Leu Thr Gly 
460 465 470 

ACG A AG AGA AC A TCA CCG GGT GTA AAG ATA GCA CCT TTC AG A ATC CTA 1 553 

Thr Lys Arg Thr Ser Pro Gly Val Lys lie Ala Pro Phe Arg lie Leu 
475 480 485 

GAA GAG CAT CAA AGT AGA CTA AAA GCG CTT TCT AAA ACC GTG GA A CTC 1601 
Glu Glu His Gin Ser Arg Leu Lys Ala Leu Scr Lys Thr Val Glu Leu 
490 495 500 

GGG AAA CG A TTC TTC CCG CGC TGT TCG GCA GTG CTC G AC CAG ATP ATG 1 649 

Gly Lys Arg Phe Phe Pro Arg Cys Scr Ala Val Leu Asp Gin lie Met 
505 510 515 

A AC TGT GAG GAC TTG ACT CAA CTG GCT TGC GGA GAA GAC GAC ACT GCT 1697 
Asn Cys Glu Asp Leu Thr Gin Leu Ala Cys Gly Glu Asp Asp Thr Ala 
520 525 530 535 

GAG AAA CGA CTA CAA AAG AAG CAA AGG TAC ATG GAA ATA CAA GAG ACA 1745 
Glu Lys Arg Leu Gin Lys Lys Gin Arg Tyr Met Glu lie Gin Glu Thr 
540 545 550 

CTA AAG AAG GCC TTT AGT GAG GAC AAT TTG GAA TTA GGA A AT TCG TCC 1 793 

Leu Lys Lys Ala Phe Scr Glu Asp Asn Leu Glu Leu Gly Asn Ser Scr 
555 560 565 

CTG ACA GAT TCG ACT TCT TCC ACA TCG AAA TCA ACC GGT GGA AAG AGG 1841 
Leu Thr Asp Ser Thr Ser Ser Thr Ser Lys Scr Thr Gly Gly Lys Arg 
570 575 580 

TCT A AC CGT AAA CTC TCT CAT CGT CGT CGG TG AG ACTCTT GCCTCTTAGT GTA 1 894 
Ser Asn Arg Lys Leu Ser His Arg Arg Arg 
585 590 

ATTTTTGCTG TACCATATAA TTCTGTTTTC ATGATGACTG TAACTGTTTA TGTCTATCGT 1954 

TGGCGTC ATA TAGTTTCGCT CTTCGTTTTG CATCCTGTGT ATTATTGCTG CAGGTGTGCT 20 1 4 

TCAAACAAAT GTTGTA ACAA TTTG AACCAA TGGTATACAG ATTTGTAATA TATATTTATG 2074 

TACATC A ACA AT AA AAA AAA A AAA AAA AAA 2 1 04 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Asp Thr Thr lie Asp Gly Phe Ala Asp Ser Tyr Glu He Ser Ser 
1 5 10 15 
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Thr Scr Phc Va) Ala Thr Asp Asn Thr Asp Scr Ser lie Val Tyr Leu 

20 25 30 

Ala Ala Glu Gin Val Leu Thr Gly Pro Asp Va] Ser Ala Leu Gin Leu 

35 40 45 

Leu Scr Asn Ser Phc Glu Scr Val Phe Asp Ser Pro Asp Asp Phe Tyr 

50 55 60 

Ser Asp Ala Lys Leu Val Leu Ser Asp Gly Arg Glu Val Ser Phe His 
65 70 75 80 

Arg Cys Val Leu Scr Ala Arg Ser Ser Phe Phe Lys Scr Ala Leu Ala 

85 90 95 

Ala Ala Lys Lys Glu Lys Asp Scr Asn Asn Thr Ala Ala Val Lys Leu 

100 105 110 

Glu Leu Lys Glu lie Ala Lys Asp Tyr Glu Val Gly Phc Asp Ser Val 

115 120 125 

Val Thr Val Leu Ala Tyr Val Tyr Ser Scr Arg Val Arg Pro Pro Pro 

130 135 140 

Lys Gly Val Ser Glu Cys Ala Asp Glu Asn Cys Cys His Val Ala Cys 
145 150 155 160 

Arg Pro Ala Val Asp Phe Met Leu Glu Va! Leu Tyr Leu Ala Phc lie 

165 170 175 

Phc Lys lie Pro Glu Leu lie Thr Leu Tyr Gin Arg His Leu Leu Asp 

180 185 190 

Val Val Asp Lys Val Val He Glu Asp Thr Leu Val lie Leu Lys Leu 

195 200 205 

Ala Asn He Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys 

210 215 220 

Glu He He Val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser 
225 230 235 240 

Leu Pro Glu Glu Leu Val Lys Glu He lie Asp Arg Arg Lys Glu Leu 

245 250 255 

Gly Leu Glu Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys 

260 265 270 

Ala Leu Asp Scr Asp Asp lie Glu Leu Val Lys Leu Leu Leu Lys Glu 

275 280 285 

Asp His Thr Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala 

290 295 300 

Tyr Cys Asn Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala 
305 310 315 320 

Asp Val Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala 

325 330 335 

Ala Met Arg Lys Glu Pro Gin Leu lie Leu Ser Leu Leu Glu Lys Gly 

340 345 350 

Ala Ser Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met He 

355 360 365 

Ala Lys Gin Ala Thr Met Ala Val Glu Cys Asn Asn He Pro Glu Gin 

370 375 380 

Cys Lys His Ser Leu Lys Gly Arg Leu Cys Val Glu lie Leu Glu Gin 
385 390 395 400 

Glu Asp Lys Arg Glu Gin He Pro Arg Asp Val Pro Pro Ser Phe Ala 

405 410 415 

Val Ala Ala Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 

420 425 430 

Val Ala Leu Ala Gin Arg Leu Phe Pro Thr Glu Ala Gin Ala Ala Met 

435 440 445 

Glu lie Ala Glu Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu 

450 455 460 

G)u Pro Asp Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val Lys 
465 470 475 480 

He Ala Pro Phc Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala 
485 490 495 
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Leu Ser Lys Thr Val Glu Leu Gly Lys Arg Phc Phe Pro Arg Cys Ser 

500 505 510 

Ala Val Leu Asp Gin lie Met Asn Cys Glu Asp Leu Thr Gin Leu Ala 

515 520 525 

Cvs Gly Glu Asp Asp Thr Ala Glu Lys Arg Leu Gin Lys Lys Gin Aig 

530 535 540 

Tyr Met Glu He Gin Glu Thr Leu Lys Lys Ala Phc Ser Glu Asp Asn 
545 550 555 560 

Leu Glu Leu Gly Asn Ser Ser Leu Thr Asp Ser Thr Ser Ser Thr Ser 

565 570 575 

Lys Ser Thr Gly Gly Lys Arg Ser Asn Arg Lys Leu Ser flis Arg Arg 
580 585 590 

Arg 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala Ala Met 
15 10 15 

Arg Lys Glu Pro Gin Leu He Leu Ser Leu Leu Glu Lys Gly Ala Ser 

20 25 30 

Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met He Ala Lys 
35 40 45 

Gin 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS. 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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Asn Ala Lys Thr Lys Asn Gly Tyr Thr Ala Leu His Gin Ala Ala Gin 
15 10 15 

Gin Gly His Thr His He lie Asn Val Leu Leu Gin Asn Asn Ala Ser 

20 25 30 

Pro Asn Glu Leu Thr Val Asn Gly Asn Thr Ala Leu Ala lie Ala Arg 
35 40 45 

Arg 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Lys Val Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp Ser Asp 
15 10 15 

Asp He Glu Leu Val Lys Leu Leu Leu Lys Glu Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Lys Thr Lys Asn Gly Leu Ser Pro Leu His Met Ala Thr Gin Gly Asp 
15 10 15 

His Leu Asn Cys Val Gin Leu Leu Leu Ser Arg Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Lys His Val Ser Asn Val His Lys Ala Leu Asp Ser Asp Asp He GIu 
15 10 15 

Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr Asn Leu Asp Asp Ala 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn Val Lys 
1 5 10 15 

Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn His Arg 
20 25 30 

Asn 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Arg Gly Tyr Thr Val Leu His Val Ala Ala Met Arg Lys Glu Pro Gin 
15 10 15 
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Leu lie Leu Ser Leu Leu Glu Lys Gly Ala Scr Ala Ser Glu Ala Thr 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 : 

Glu Gly Arg Thr Ala Leu Met lie Ala Lys Gin Ala Thr Met Ala Val 
15 10 15 

Glu Cys Asn Asn He Pro Glu Gin Cys Lys His Ser Leu Lys Gly Arg 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 12: 

Gly Thr Pro Leu His Leu Ala Ala Arg Gly His Val Glu Val Val Lys 
15 10 15 

Leu Leu Leu Asp Gly Ala Asp Val Asn Ala Thr Lys Ala He Ser Gin 

20 25 30 

Asn Asn Leu Asp He Ala Glu Val Lys Asn Pro Asp Asp Val Lys Thr 

35 40 45 

Met Arg Gin Ser He Asn Glu 
50 55 

(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GTGACTTTCT A ACTATGGCT GA AATTGCAG AACGAAAAAG ACTTTCCATT TTTCACTTG A 60 

ATGA A ACCCA A A ATGG A AAT CTATCTCTCT TCTTCTTCTC TTTTA CTACC TCCA TTTCC A 1 20 

TGGCTTTCCC TCCTCTACCT TCCCTAGCTC TTTTCAATTT CTAGA ATATT CTTTTCTTAG 1 80 

TCTGTAATTA TCTATAGCTC AATTTCTAAG ACAGAACTTA TGTAAGGCGG CTTTCTGTAA 240 

TGGATA ATAG TAGGACTGCG TTTTCTGATT CGAATGACAT CAGCGG A AGC AGTAGTATAT 300 

GCTGCATCGG CGGCGGCATG ACTGAATTTT TCTCGCCGGA GACTTCGCCG GCGGAGATCA 360 

CTTCACTG AA ACGCCTATCG GAA ACACTGG A ATCTATCTT CG ATGCGTCT TTGCCGG AGT 420 

TTGACTACTT CGCCGACGCT AAGCTTGTGG TTTCCGGCCC GTGTAAGGAA ATTCCGGTGC: 480 

ACCGGTGCAT TTTGTCGGCG AGG AGTCCGT TCTTTAAGAA TTTGTTCTGC GGTA A A A AGG 540 

AGAAGAATAG TAGTAAGGTG GAATTGAAGG AGGTGATGAA AGAGCATGAG GTGAGCTATG 600 

ATGCTGTAAT GAGTGTATTG GCTTATTTGT ATAGTGGTAA AGTTAGGCCT TCACCTAAAG 660 

ATGTGTGTGT TTGTGTGGAC AATGACTGCT CTCATGTGGC TTGTAGGCCA GCTGTGGCAT 720 

TCCTGGTTGA GGTTTTGTAC ACATCATTTA CCTTTCAGAT CTCTGAATTG GTTGACAAGT 780 

TTCAGAGACA CCTACTGGAT ATTCTTGACA AAACTGCAGC AGACGATGTA ATGATGGTTT 840 

TATCTGTTGC AAACATTTGT GGTAAAGCAT GCGAGAGATT GCTTTCAAGC TGCATTGAGA 900 

TTATTGTCAA GTCTAATGTT GATATCATAA CCCTTGATA A AGCCTTGCCT CATG ACATTG 960 

TAA AACAAAT TACTGATTCA CGAGCGGAAC TTGGTCTACA AGGGCCTGAA AGCAACGGTT 1020 

TTCCTG ATAA ACATGTTAAG AGGATACATA GGGCATTGGA TTCTGATGAT GTTGAATTAC 1080 

TACAA ATGTT GCTAAGAGAG GGGCATACTA CCCTAGATGA TGCATATGCT CTCCATTATG 1 140 

CTGTAGCGTA TTGCG ATGCA AAGACTACAG CAGA ACTTCT AG ATCTTGCA CTTGCTGATA 1 200 

TTAATCATCA AAATTCA AGG GG ATACACGG TGCTGCATGT TGCAGCCATG AGGA A AG AGC 1 260 

CTAAA ATTGT AGTGTCCCTT TTAACCAA AG GAGCTAGACC TTCTG ATCTG ACATCCGATG 1 320 

G AAG AA A AGC ACTTCA A ATC GCCA AG AGGC TCACTAGGCT TGTGG ATTTC AGTA AGTCTC 1 380 

CGG AGGA AGG A AAATCTGCT TCGAATG ATC GGTTATGCAT TGAG ATTCTG GAGCAAGCAG 1440 

A AAG A AG AGA CCCTCTGCTA GGAG A AGCTT CTGTATCTCT TGCTATGGCA GGCGAT GATT 1 500 

TGCGTATGAA GCTGTTATAC CTTGA AA ATA GAGTTGGCCT GGCTA AACTC CTTTTTCCA A 1 560 

TGG AAGCTA A AGTTGCAATG G ACATTGCTC AAGTTG ATGG CACTTCTG AG TTCCCACTGG 1 620 

CTAGCATCGG CAAAAAGATG GCTAATGCAC AGAGGACAAC AGTAGATTTG AACGAGGCTC 1680 

CTTTCAAG AT A AAAG AGG AG CACTTGA ATC GGCTTAGAGC ACTCTCTAGA ACTGTAGA AC 1 740 

TTGG A A A ACG CTTCTTTCCA CGTTGTTCAG AAGTTCTAAA TAAG ATCATG G ATGCTGATG 1 800 

ACTTGTCTG A G ATAGCTTAC ATGGGGAATG ATACGGCAGA AGAGCGTCAA CTG AAGA AGC 1 860 

AAAGGTACAT GG A ACTTCA A G AAATTCTGA CTAAAGCATT CACTG AGGAT A A AG A AGA AT 1 920 

ATGATAAGAC TAACAACATC TCCTCATCTT GTTCCTCTAC ATCTA AGGGA GTAGATAAGC 1980 

CCAATAAGCT CCCTTTTAGG AAATAGGTAA TTGTATTAGG ATATATGAGC A AG A AG AGG A 2040 

TTTTCTTGTA ACATAGCACT CTTTCCTTTC ATCATTTGAT ATGTCAACAT ACATACAACA 2 100 

GCTGTACCAT AA ACTTGTAT TGTTGCACTT ACAACTTTGA AG AACAG AAT TTATTTGAAA 2 160 

AA AA A AA AA A AA 2 1 72 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Asn Ser Arg Thr Ala Phe Ser Asp Ser Asn Asp He Ser Gly 
15 10 15 

Ser Ser Ser He Cys Cys He Gly Gly Gly Met Thr Glu Phe Phe Ser 

20 25 30 

Pro Glu Thr Ser Pro Ala Glu He Thr Ser Leu Lys Arg Leu Ser Glu 

35 40 45 

Thr Leu Glu Ser He Phe Asp Ala Ser Leu Pro Glu Phe Asp Tyr Phe 

50 55 60 

Ala Asp Ala Lys Leu Val Val Ser Gly Pro Cys Lys Glu He Pro Val 
65 70 75 80 

His Arg Cys He Leu Ser Ala Arg Ser Pro Phe Phe Lys Asn Leu Phe 

85 90 95 

Cys Gly Lys Lys Glu Lys Asn Ser Ser Lys Val Glu Leu Lys Glu Val 

100 105 110 

Met Lys Glu His Glu Val Ser Tyr Asp Ala Val Met Ser Val Leu Ala 

115 120 125 

Tyr Leu Tyr Ser Gly Lys Val Arg Pro Ser Pro Lys Asp Val Cys Val 

130 135 140 

Cys Val Asp Asn Asp Cys Ser His Val Ala Cys Arg Pro Ala Val Ala 
145 150 155 160 

Phe Leu Val Glu Val Leu Tyr Thr Ser Phe Thr Phe Gin He Ser Glu 

165 170 175 

Leu Val Asp Lys Phe Gin Arg His Leu Leu Asp He Leu Asp Lys Thr 

180 185 190 

Ala Ala Asp Asp Val Met Met Val Leu Ser Val Ala Asn He Cys Gly 

195 200 205 

Lys Ala Cys Glu Arg Leu Leu Ser Ser Cys He Glu lie He Val Lys 

210 215 220 

Ser Asn Val Asp He lie Thr Leu Asp Lys Ala Leu Pro His Asp He 
225 230 235 240 

Val Lys Gin He Thr Asp Ser Arg Ala Glu Leu Gly Leu Gin Gly Pro 

245 . 250 255 

Glu Ser Asn Gly Phe Pro Asp Lys His Val Lys Arg He His Arg Ala 

260 265 270 . 

Leu Asp Ser Asp Asp Val Glu Leu Leu Gin Met Leu Leu Arg Glu Gly 

275 280 285 

His Thr Thr Leu Asp Asp Ala Tyr Ala Leu His Tyr Ala Val Ala Tyr 
290 295 300 



WO 98/06748 



PCTWS97/13994 



-78- 

Cys Asp Ala Lys Thr Thr Ala Glu Leu Leu Asp Leu Ala Leu Ala Asp 

305 310 315 320 

lie Asn His Gin Asn Ser Arg Gly Tyr Thr Val Leu His Val Ala Ala 

325 330 335 

Met Arg Lys Glu Pro Lys He Val Val Ser Leu Leu Thr Lys Gly Ala 

340 345 350 

Arg Pro Scr Asp Leu Thr Ser Asp Gly Arg Lys Ala Leu Gin He Ala 

355 360 365 

Lys Arg Leu Thr Arg Leu Val Asp Phe Ser Lys Ser Pro Glu Glu Gly 

370 375 380 

Lys Ser Ala Ser Asn Asp Arg Leu Cys lie Glu He Leu Glu Gin Ala 
385 390 395 400 

Glu Arg Arg Asp Pro Leu Leu Gly Glu Ala Ser Val Ser Leu Ala Met 

405 410 415 

Ala Gly Asp Asp Leu Arg Met Lys Leu Leu Tyr Leu Glu Asn Arg Val 

420 425 430 

Gly Leu Ala Lys Leu Leu Phe Pro Met Glu Ala Lys Val Ala Met Asp 

435 440 445 

He Ala Gin Val Asp Gly Thr Ser Glu Phe Pro Leu Ala Ser lie Gly 

450 455 460 

Lys Lys Met Ala Asn Ala Gin Arg Thr Thr Val Asp Leu Asn Glu Ala 
465 470 475 480 

Pro Phe Lys He Lys Glu Glu His Leu Asn Arg Leu Arg Ala Leu Ser 

485 490 495 

Arg Thr Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser Glu Val 

500 505 510 

Leu Asn Lys He Met Asp Ala Asp Asp Leu Ser Glu He Ala Tyr Met 

515 520 525 

Gly Asn Asp Thr Ala Glu Glu Arg Gin Leu Lys Lys Gin Arg Tyr Met 

530 535 540 

Glu Leu Gin Glu He Leu Thr Lys Ala Phe Thr Glu Asp Lys Glu Glu 
545 550 555 560 

Tyr Asp Lys Thr Asn Asn lie Ser Ser Ser Cys Ser Ser Thr Ser Lys 

565 570 575 

Gly Val Asp Lys Pro Asn Lys Leu Pro Phe Arg Lys 
580 585 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTG ACAGACT TGCTCCTACT G 2 1 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAGTGTGTAT CAAAGCACCA 20 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTCTCCAGAC CACATGATTA T 21 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



WO 98/06748 



PCT/US97/13994 



-80- 

TGAAGCTAAT ATGCACAGGA G 2 1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTAGGTGCTC TTGTTCTTCC C 2 1 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
C AC ATAATTC CCACGAGGAT C 2 1 

(2) INFORMATION FOR SEQ ID NO:21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 : 

Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu Glu Pro Asp Arg 
15 10 15 

Leu 
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(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Arg Arg Lys Glu Leu Gly Leu Glu Val Pro Lys Val Lys Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Lys Lys Gin Arg Tyr Met Glu He Gin Glu Thr Leu Lys Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
AARGARGAYC AYACNAA 
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(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TAYGTYAAYG TNAARAC 17 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GCCATNGTNG CYTGYTT 1 7 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AARGTNAARA ARCAYGT 17 
(2) INFORMATION FOR SEQ ID NO:28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



RAAYTCRCAN GTNCCYTTCA T 



21 
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We claim: 
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Claims 

1 . An isolated nucleic acid molecule comprising a sequence encoding an acquired 
resistance polypeptide, wherein said acquired resistance polypeptide is capable of conferring, 
on a plant expressing said polypeptide, resistance to a plant pathogen. 

5 

2. The isolated nucleic acid molecule of claim 1 , wherein said polypeptide is capable 
of mediating the expression of a pathogenesis-related polypeptide. 

3. The isolated nucleic acid molecule of claim 1, wherein said polypeptide comprises 
1 0 an ankyrin-repeat motif. 

4. The isolated nucleic acid molecule of claim 1, wherein said polypeptide is obtained 
from an angiosperm. 

15 5. The isolated nucleic acid molecule of claim 4, wherein said angiosperm is a 

member of the Solanaceae or the Cruciferae. 

6. The isolated nucleic acid molecule of claim 1 , wherein said nucleic acid molecule 
is genomic DNA or cDNA. 

20 

7. The isolated nucleic acid molecule of claim 1, wherein said plant pathogen is a 
bacterium, virus, viroid, fungus, nematode, or insect. 

8. An isolated nucleic acid molecule that encodes an acquired resistance polypeptide 
25 that specifically hybridizes to a nucleic acid molecule comprising the genomic nucleic acid 

sequence of Fig. 4 (SEQ ID NO:l). 



30 



9. An isolated nucleic acid molecule that encodes an acquired resistance polypeptide 
that specifically hybridizes to a nucleic acid molecule comprising the cDNA of Fig. 5 (SEQ 
ID NO:2). 
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10. An isolated nucleic acid molecule that encodes an acquired resistance polypeptide 
that specifically hybridizes to a nucleic acid molecule comprising CDNA sequence of Fig. 7 A 
(SEQ ID NO: 13). 

5 11. The isolated nucleic acid molecule of claims 8-10, wherein said nucleic acid 

molecule encodes a polypeptide that mediates the expression of a pathogenesis-related 
polypeptide. 

12. The isolated nucleic acid molecule of claims 8-10, wherein said nucleic acid 
1 0 molecule encodes a polypeptide comprising an ankyrin-repeat motif. 

13. The isolated nucleic acid molecule of claims 1 or 8-10, wherein said nucleic acid 
molecule is operably linked to an expression control region. 

15 14. A vector comprising the nucleic acid molecule of claims 1 or 8-10, said vector 

being capable of directing expression of the polypeptide encoded by said nucleic acid 
molecule. 

15. A cell comprising an isolated nucleic acid molecule of claims 1, 8-10, or 14. 

20 

16. The cell of claim 15, wherein said cell is a plant cell. 

1 7. The cell of claim 15, wherein said cell is a bacterial cell. 

25 1 8. The cell of claim 17, wherein said bacterial cell is Agrobacterium. 

19. The cell of claim 1 6, wherein said plant cell has increased resistance to a plant 
pathogen. 



30 



20. A transgenic plant comprising a nucleic acid molecule of claim 1 , 8-10, or 14, 
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wherein said nucleic acid molecule is expressed in said transgenic plant. 

21. The transgenic plant of claim 20, wherein said transgenic plant is an angiosperm. 

5 22. The transgenic plant of claim 20, wherein said transgenic angiosperm is a 

monocot or a dicot. 

23. The transgenic plant of claim 20, wherein said dicot is a cruciferous plant or a 
solanaceous plant. 

10 

24. A seed from a transgenic plant of claim 20. 

25. A cell from a transgenic plant of claim 20. 

15 26. A substantially pure acquired resistance polypeptide including an amino acid 

sequence that has at least 40% identity to the amino acid sequence of Fig. 5 (SEQ ID NO:3) 
or Fig. 7B (SEQ ID NO: 14). 

27. The substantially pure polypeptide of claim 26, wherein said polypeptide is 
20 capable of mediating the expression of a pathogenesis-related polypeptide. 

28. The substantially pure polypeptide of claim 26, wherein said polypeptide includes 
an ankyrin-repeat motif or a G-protein coupled receptor motif. 

25 29. The substantially pure polypeptide of claim 26, wherein said polypeptide is 

obtained from an angiosperm. 

30. The substantially pure polypeptide of claim 29, wherein said angiosperm is a 
member of the Solanaceae or the Cruciferae. 
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3 1 . A method of producing an acquired resistance polypeptide, said method 

comprising the steps of: 

(a) providing a cell transformed with a nucleic acid molecule of claims 1 or 
8-10 positioned for expression in the cell; 

5 (b) culturing the transformed cell under conditions for expressing the nucleic acid 

molecule; and 

(c) recovering the acquired resistance polypeptide. 

32. A recombinant acquired resistance polypeptide produced by the method of claim 

10 31. 

33. A substantially pure antibody that specifically recognizes and binds to an 
acquired resistance polypeptide or a portion thereof. 

1 5 34. The substantially pure antibody of claim 33, wherein said antibody recognizes and 

binds to a recombinant acquired resistance polypeptide or a portion thereof. 

35. A method of providing an increased level of resistance against a disease caused 
by a plant pathogen in a transgenic plant, said method comprising the steps of: 

20 (a) producing a transgenic plant cell including the nucleic acid molecule of claims 1 

or 8-10, wherein said nucleic acid is positioned for expression in the plant cell; and 

(b) growing a transgenic plant from the plant cell wherein the nucleic acid molecule is 
expressed in the transgenic plant and the transgenic plant is thereby provided with an 
increased level of resistance against a disease caused by a plant pathogen. 

25 

36. The method of claim 35, wherein said plant pathogen is a bacterium, virus, viroid, 
fungus, nematode, or insect. 

37. The method of claim 35, wherein said plant pathogen is Phytophthora, 
30 Peronospora, or Pseudomonas. 
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38. A method of isolating an acquired resistance gene or fragment thereof, said 
method comprising the steps of: 

(a) contacting the nucleic acid molecule of Fig. 4 (SEQ ID NO:l), Fig. 5 (SEQ ID 
NO:2), or Fig. 7A (SEQ ID NO: 1 3) or a portion thereof with a preparation of DNA from a 
plant cell under hybridization conditions providing detection of DNA sequences having at 
least 40% or greater sequence identity to the nucleic acid sequence of Fig. 4 (SEQ ID NO:l), 
Fig. 5 (SEQ ID NO:2), or Fig. 7A (SEQ ID NO: 13); and 

(b) isolating said hybridizing DNA. 

39. A method ofisolating an acquired resistance gene or fragment thereof, said 
method comprising the steps of: 

(a) providing a sample of plant cell DNA; 

(b) providing a pair of oligonucleotides having sequence identity to a region of the 
nucleic acid of Fig. 4 (SEQ ID NO:l), Fig. 5 (SEQ ID NO:2), or Fig. 7 A (SEQ ID NO: 13); 

(c) contacting the pair of oligonucleotides with said plant cell DNA under conditions 
suitable for polymerase chain reaction-mediated DNA amplification; and 

(d) isolating the amplified acquired resistance gene or fragment thereof. 

40. The method of claim 39, wherein said amplification step is carried out using a 
sample of cDNA prepared from a plant cell. 

41 . The method of claim 39, wherein said pair of oligonucleotides are based on a 
sequence encoding an acquired resistance polypeptide, wherein the acquired resistance 
polypeptide is at least 40% identical to the amino acid sequence of Fig. 5 (SEQ ID NO:3) or 
Fig. 7B (SEQ ID NO: 14). 
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10 20 30 40 50 

***** 

AAGCTTGTGA TGCAAGTCAT GGGATATTGC TTTGTGTTAA GTATACAAAA 
TTCGAACACT ACGTTCAGTA CCCTATAACG AAACACAATT CATATGTTTT 



60 70 80 90 100 

***** 

CCATCACGTG GATACATAGT CTTCAAACCA ACCACTAAAC AGTATCAGGT 
GGTAGTGCAC CTATGTATCA GAAGTTTGGT TGGTGATTTG TCATAGTCCA 

110 120 130 140 150 

***** 

CATACCAAAG CCAGAAGTGA AGGGTTGGGA TATGTCATTG GGTTTAGCGG 
GTATGGTTTC GGTCTTCACT TCCCAACCCT ATACAGTAAC CCAAATCGCC 

160 170 180 190 200 

***** 

TAATCGGATT GAACCCTTTC CGGTATAAAA TACAAAGGCT TTCGCAGTCT 
ATTAGCCTAA CTTGGGAAAG GCCATATTTT ATGTTTCCGA AAGCGTCAGA 



210 220 230 240 250 

***** 

CGGCGTATGT GTATGTCTCG GGGTATCTAC CATTTGAATC ACAGAACTTT 
GCCGCATACA CATACAGAGC CCCATAGATG GTAAACTTAG TGTCTTGAAA 



260 270 280 290 300 

***** 

TATGTGCGAA GTTTTCGATT CTGATTCGTT TACCTGGAAG AGATTAGAAA 
ATACACGCTT CAAAAGCTAA GACTAAGCAA ATGGACCTTC TCTAATCTTT 

310 320 330 340 350 

***** 

TTTGCGTCTA CCAAAAACAG ACAGATTAAT TTTTTCCAAC CCGATACAAG 
AAACGCAGAT GGTTTTTGTC TGTCTAATTA AAAAAGGTTG GGCTATGTTC 

360 370 380 390 400 

***** 

TTTCGGGGTT CTTGCATTGG ATATCACGGA ACAACAATGT GATCCGGTTT 
AAAGCCCCAA GAACGTAACC TATAGTGCCT TGTTGTTACA CTAGGCCAAA 



410 420 430 440 450 

***** 

TGTCTCAAAA CCGAAACTTG GTCCTTCTTC CATACTCCGA ACTCTGATGT 
ACAGAGTTTT GGCTTTGAAC CAGGAAGAAG GTATGAGGCT TGAGACTACA 

460 470 480 490 500 

***** 

TTTCTCAGGA TTAGTCAGAT ACGAAGGGAA GCTAGGTGCT ATTCGTCAGT 
AAAGAGTCCT AATCAGTCTA TGCTTCCCTT CGATCCACGA TAAGCAGTCA 

510 520 530 540 550 

***** 

GGACAAACAA AGATCAAGAA GATGTTCACG AGTTATGGGT TTTAAAGAGC 



WO 98/06748 PCIYUS97/13994 

5/34 



CCTGTTTGTT TCTAGTTCTT CTACAAGTGC TCAATACCCA AAATTTCTCG 
560 570 580 590 600 

* * * * * 

AGTTTTGAAA AGTCGTGGGT TAAAGTGAAA GATATTAAAA GCATTGGAGT 
TCAAAACTTT TCAGCACCCA ATTTCACTTT CTATAATTTT CGTAACCTCA 

610 620 630 640 650 

***** 

AGATTTGATT ACGTGGACTC CAAGCAACGA CGTTGTATTG TTTCGTAGTA 
TCTAAACTAA TGCACCTGAG GTTCGTTGCT GCAACATAAC AAAGCATCAT 



660 670 680 690 700 

***** 

GTGATCGTGG TTGCCTCTAC AACATAAACG CAGAGAAGTT GAATTTAGTT 
CACTAGCACC AACGGAGATG TTGTATTTGC GTCTCTTCAA CTTAAATCAA 

710 720 730 740 750 

***** 

TATGCAAAAA AAGAGGGATC TGATTGTTCT TTCGTTTGTT TTCCGTTTTG 
ATACGTTTTT TTCTCCCTAG ACTAACAAGA AAGCAAACAA AAGGCAAAAC 



760 770 780 790 800 

***** 

TTCTGATTAC GAGAGGGTTG ATCTGAACGG AAGAAGCAAC GGGCCGACAC 
AAGACTAATG CTCTCCCAAC TAGACTTGCC TTCTTCGTTG CCCGGCTGTG 

810 820 630 640 850 

***** 

TTTAAAAAAA AAATAAAAAA AATGGGCCGA CAAATGCAAA CGTAGTTGAC 
AAATTTTTTT TTTATTTTTT TTACCCGGCT GTTTACGTTT GCATCAACTG 



860 870 880 890 900 

***** 

AAGGATCTCA AGTCTCAAGT CTCAATTGGC TCGCTCATTG TGGGGCATAA 
TTCCTAGAGT TCAGAGTTCA GAGTTAACCG AGCGAGTAAC ACCCCGTATT 



910 920 930 940 950 

* * * * * 

ATATATCTAG TGATGTTTAA TTGTTTTTTA TAAGGTAAAA AGGAATATTG 
TATATAGATC ACTACAAATT AACAAAAAAT ATTCCATTTT TCCTTATAAC 

960 970 980 990 1000 

***** 

AATTTTGTTT CTTAGGTTTA TGTAATAATA CCAAACATTG TTTTATGAAT 
TTAAAACAAA GAATCCAAAT ACATTATTAT GGTTTGTAAC AAAATACTTA 



1010 1020 1030 1040 1050 

***** 

ATTTAATCTG ATTTTTTGGC TAGTTATTTT ATTATATCAA GGGTTCCTGT 
TAAATTAGAC TAAAAAACCG ATCAATAAAA TAATATAGTT CCCAAGGACA 



1060 1070 1080 1090 1100 



WO 98/06748 



PCT/US97/13994 



6/34 

TTATAGTTGA AAACAGTTAC TG TAT AG AAA ATAGTGTCCC AATTTTCTCT 
AATATCAACT TTTGTCAATG ACATATCTTT TATCACAGGG TTAAAAGAGA 

1110 1120 1130 1140 1150 

***** 
CTTAAATAAT ATATTAGTTA ATAAAAGATA TTTTAATATA TTAGATATAC 
GAATTTATTA TATAATCAAT TATTTTCTAT AAAATTATAT AATCTATATG 

1160 1170 1180 1190 1200 

***** 
AATAATATCT AAAGCAACAC ATATTTAGAC ACAACACGTA ATATCTTACT 
TTATTATAGA TTTCGTTGTG TATAAATCTG TGTTGTGCAT TATAGAATGA 

1210 1220 1230 1240 1250 

***** 

ATTGTTTACA TATATTTATA GCTTACCAAT ATAACCCGTA TCTATGTTTT 
TAACAAATGT ATATAAATAT CGAATGGTTA TATTGGGCAT AGATACAAAA 

1260 1270 1280 1290 1300 

***** 

ATAAGCTTTT ATACAATATA TGTACGGTAT GCTGTCCACG TATATATATT 
TATTCGAAAA TATGTTATAT ACATGCCATA CGACAGGTGC ATATATATAA 

1310 1320 1330 1340 1350 

***** 

CTCCAAAAAA AACGCATGGT ACACAAAATT TATTAAATAT TTGGCAATTG 
GAGGTTTTTT TTGCGTACCA TGTGTTTTAA ATAATTTATA AACCGTTAAC 

1360 1370 1380 1390 X4O0 

***** 

GGTGTTTATC TAAAGTTTAT CACAATATTT ATCAACTATA ATAGATGGTA 
CCACAAATAG ATTTCAAATA GTGTTATAAA TAGTTGATAT TATCTACCAT 

1410 1420 1430 1440 1450 

***** 

GAAGATAAAA AAATTATATC AGATTGATTC AATTAAATTT TATAATATAT 
CTTCTATTTT TTTAATATAG TCTAACTAAG TTAATTTAAA ATATTATATA 

1460 1470 1480 1490 1500 

***** 

CATTTTAAAA AATTAATTAA AAGAAAACTA TTTCATAAAA TTGTTCAAAA 
GTAAAATTTT TTAATTAATT TTCTTTTGAT AAAGTATTTT AACAAGTTTT 

1510 1520 1530 1540 1550 

***** 

GATAATTAGT AAAATTAATT AAATATGTGA TGCTATTGAA TTATAGAGAG 
CTATTAATCA TTTTAATTAA TTTATACACT ACGATAACTT AATATCTCTC 

1560 1570 1580 1590 1600 

***** 

TTATTGTAAA TTTACTTAAA ATCATACAAA TCTTATCCTA ATTTAACTTA 
AATAACATTT AAATGAATTT TAGTATCTTT AGAATAGGAT TAAATTGAAT 

1610 1620 1630 1640 1650 
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TCATTTAAGA AATACAAAAG TAAAAAACGC GGAAAGCAAT AATTTATTTA 
AGTAAATTCT TTATGTTTTC ATTTTTTGCG CCTTTCGTTA TTAAATAAAT 

1660 1670 1680 1690 1700 

* * * * * 

CCTTATTATA ACTCCTATAT AAAGTACTCT GTTTATTCAA CATAATCTTA 
GGAATAATAT TGAGGATATA TTTCATGAGA CAAATAAGTT GTATTAGAAT 

1710 1720 1730 1740 1750 

* * * * * 

CGTTGTTGTA TTCATAGGCA TCTTTAACCT ATCTTTTCAT TTTCTGATCT 
GCAACAACAT AAGTATCCGT AGAAATTGGA TAGAAAAGTA AAAGACTAGA 

1760 1770 1780 1790 1800 

***** 

CGATCGTTTT CGATCCAACA AAATGAGTCT ACCGGTGAGG AACCAAGAGG 
GCTAGCAAAA GCTAGGTTGT TTTACTCAGA TGGCCACTCC TTGGTTCTCC 

1810 1820 1830 1840 1850 

***** 

TGATTATGCA GATTCCTTCT TCTTCTCAGT TTCCAGCAAC ATCGAGTCCG 
ACTAATACGT CTAAGGAAGA AGAAGAGTCA AAGGTCGTTG TAGCTCAGGC 

1860 1870 1880 1890 1900 

***** 

GAAAACACCA ATCAAGTGAA GGATGAGCCA AATTTGTTTA GACGTGTTAT 
CTTTTGTGGT TAGTTCACTT CCTACTCGGT TTAAACAAAT CTGCACAATA 

1910 1920 1930 1940 1950 

***** 

GAATTTGCTT TTACGTCGTA GTTATTGAAA AAGCTGATTT ATCGCATGAT 
CTTAAACGAA AATGCAGCAT CAATAACTTT TTCGACTAAA TAGCGTACTA 

1960 1970 1980 1990 2000 

***** 

TCAGAACGAG AAGTTGAAGG CAAATAACTA AAGAAGTCTT TTATATGTAT 
AGTCTTGCTC TTCAACTTCC GTTTATTGAT TTCTTCAGAA AATATACATA 

2010 2020 2030 2040 2050 

***** 

ACAATAATTG TTTTTAAATC AAATCCTAAT TAAAAAAATA TATTCATTAT 
TGTTATTAAC AAAAATTTAG TTTAGGATTA ATT TT T TT AT ATAAGTAATA 

2060 2070 2080 2090 2100 

***** 

GACTTTCATG TTTTTAATGT AATTTATTCC TATATCTATA ATGATTTTTG 
CTGAAAGTAC AAAAATTACA TTAAATAAGG ATATAGATAT TACTAAAAAC 

2110 2120 2130 2140 2150 

***** 

TTGTGAAGAG CGTTTTCATT TGCTATAGAA CAAGGAGAAT AGTTCCAGGA 
AACACTTCTC GCAAAAGTAA ACGATATCTT GTTCCTCTTA TCAAGGTCCT 
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2160 2170 2180 2190 2200 

***** 

AATATTCGAC TTGATTTAAT TATAGTGTAA ACATGCTGAA CACTGAAAAT 
TTATAAGCTG AACTAAATTA ATATCACATT TGTACGACTT GTGACTTTTA 

2210 2220 2230 2240 2250 

* * * * * 

TACTTTTTCA ATAAACGAAA AATATAATAT ACATTACAAA ACTTATGTGA 
ATGAAAAAGT TATTTGCTTT TTATATTATA TGTAATGTTT TGAATACACT 

2260 2270 2280 2290 2300 

***** 

ATAAAGCATG AGACTTAATA TACGTTCCCT TTATCATTTT ACTTCAAAGA 
TATTTCGTAC TCTGAATTAT ATGCAAGGGA AATAGTAAAA TGAAGTTTCT 

2310 2320 2330 2340 2350 

***** 

AAATAAACAG AAATGTAACT TTCACATGTA AATCTAATTC TTAAATTTAA 
TTTATTTGTC TTTACATTGA AAGTGTACAT TTAGATTAAG AATTTAAATT 

2360 2370 2380 2390 2400 

***** 

AAAATAATAT TTATATATTT ATATGAAAAT AACGAACCGG ATGAAAAATA 
TTTTATTATA AATATATAAA TATACTTTTA TTGCTTGGCC TACTTTTTAT 

2410 2420 2430 2440 2450 

***** 

AATTTTATAT ATTTATATCA TCTCCAAATC TAGTTTGGTT CAGGGGCTTA 
TTAAAATATA TAAATATAGT AGAGGTTTAG ATCAAACCAA GTCCCCGAAT 

2460 2470 2480 2490 2500 

***** 

CCGAACCGGA TTGAACTTCT CAT AT AC AAA AATTAGCAAC ACAAAATGTC 
GGCTTGGCCT AACTTGAAGA GTATATGTTT TTAATCGTTG TGTTTTACAG 

2510 2520 2530 2540 2550 

***** 

TCCGGTATAA ATACTAACAT TTATAACCCG AACCGGTTTA GCTTCCTGTT 
AGGCCATATT TATGATTGTA AATATTGGGC TTGGCCAAAT CGAAGGACAA 

2560 2570 2580 2590 2600 

***** 

ATATCTTTTT AAAAAAGATC TCTGACAAAG ATTCCTTTCC TGGAAATTTA 
TATAGAAAAA TTTTTTCTAG AGACTGTTTC TAAGGAAAGG ACCTTTAAAT 

2610 2620 2630 2640 2650 

* * * * • 

CCGGTTTTGG TGAAATGTAA ACCGTGGGAC GAGGATGCTT CTTCATATCT 
GGCCAAAACC ACTTTACATT TGGCACCCTG CTCCTACGAA GAAGTATAGA 

2660 2670 2680 2690 2700 

* * * * 

CACCACCACT CTCGTTGACT GGACTTGGCT CTGCTCGTCA ATGGTTATCT 
GTGGTGGTGA GAGCAACTGA CCTGAACCGA GACGAGCAGT TACCAATAGA 
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2710 2720 2730 2740 2750 

* * * * * 

TCGATCTTAA ACCAAATCCA GTTGATAAGG TCTCTTCGTT GATTAGCAGA 
AGCTAGAATT TGGTTTAGGT CAACTATTCC AGAGAAGCAA CTAATCGTCT 

2760 2770 2780 2790 2800 

* * * * » 

GATCTCTTTA ATTTGTGAAT TTCAATTCAT CGGAACCTGT TGATGGACAC 
CTAGAGAAAT TAAACACTTA AAGTTAAGTA GCCTTGGACA ACTACCTGTG 

2610 2820 2830 2840 2850 

* * * * * 

CACCATTGAT GGATTCGCCG ATTCTTATGA AATCAGCAGC ACTAGTTTCG 
GTGGTAACTA CCTAAGCGGC TAAGAATACT TTAGTCGTCG TGATCAAAGC 

2860 2870 2880 2890 2900 

* * * * * 

TCGCTACCGA TAACACCGAC TCCTCTATTG TTTATCTGGC CGCCGAACAA 
AGCGATGGCT ATTGTGGCTG AGGAGATAAC AAATAGACCG GCGGCTTGTT 

2910 2920 2930 2940 2950 

***** 

GTACTCACCG GACCTGATGT ATCTGCTCTG CAATTGCTCT CCAACAGCTT 
CATGAGTGGC CTGGACTACA TAGACGAGAC GTTAACGAGA GGTTGTCGAA 

2960 2970 2980 2990 3000 

***** 

CGAATCCGTC TTTGACTCGC CGGATGATTT CTACAGCGAC GCTAAGCTTG 
GCTTAGGCAG AAACTGAGCG GCCTACTAAA GATGTCGCTG CGATTCGAAC 

3010 3020 3030 3040 3050 

***** 

TTCTCTCCGA CGGCCGGGAA GTTTCTTTCC ACCGGTGCGT TTTGTCAGCG 
AAGAGAGGCT GCCGGCCCTT CAAAGAAAGG TGGCCACGCA AAACAGTCGC 

3060 3070 3080 3090 3100 

* * * * • 

AGAAGCTCTT TCTTCAAGAG CGCTTTAGCC GCCGCTAAGA AGGAGAAAGA 
TCTTCGAGAA AGAAGTTCTC GCGAAATCGG CGGCGATTCT TCCTCTTTCT 

3110 3120 3130 3140 3150 

* * * * • 

CTCCAACAAC ACCGCCGCCG TGAAGCTCGA GCTTAAGGAG ATTGCCAAGG 
GAGGTTGTTG TGGCGGCGGC ACTTCGAGCT CGAATTCCTC TAACGGTTCC 

3160 3170 3180 3190 3200 

***** 

ATTACGAAGT CGGTTTCGAT TCGGTTGTGA CTGTTTTGGC TTATGTTTAC 
TAATGCTTCA GCCAAAGCTA AGCCAACACT GACAAAACCG AATACAAATG 

3210 3220 3230 3240 3250 

* * * * * 

AGCAGCAGAG TGAGACCGCC GCCTAAAGGA GTTTCTGAAT GCGCAGACGA 
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TCGTCGTCTC ACTCTGGCGG CGGATTTCCT CAAAGACTTA CGCGTCTGCT 

3260 3270 3280 3290 3300 

***** 

GAATTGCTGC CACGTGGCTT GCCGGCCGGC GGTGGATTTC ATGTTGGAGG 
CTTAACGACG GTGCACCGAA CGGCCGGCCG CCACCTAAAG TACAACCTCC 

3310 3320 3330 3340 3350 

***** 

TTCTCTATTT GGCTTTCATC TTCAAGATCC CTGAATTAAT TACTCTCTAT 
AAGAGATAAA CCGAAAGTAG AAGTTCTAGG GACTTAATTA ATGAGAGATA 

3360 3370 3380 3390 3400 

***** 

CAGGTAAAAC ACCATCTGCA TTAAGCTATG GTTACACATT CATGAATATG 
GTCCATTTTG TGGTAGACGT AATTCGATAC CAATGTGTAA GTACTTATAC 

3410 3420 3430 3440 3450 

***** 

TTCTTACTTG AGTACTTGTA TTTGTATTTC AGAGGCACTT ATTGGACGTT 
AAGAATGAAC TCATGAACAT AAACATAAAG TCTCCGTGAA TAACCTGCAA 

3460 3470 3480 3490 3500 

***** 

GTAGACAAAG TTGTTATAGA GGACACATTG GTTATACTCA AGCTTGCTAA 
CATCTGTTTC AACAATATCT CCTGTGTAAC CAATATGAGT TCGAACGATT 

3510 3520 3530 3540 3550 

***** 

TATATGTGGT AAAGCTTGTA TGAAGCTATT GGATAGATGT AAAGAGATTA 
ATATACACCA TTTCGAACAT ACTTCGATAA CCTATCTACA TTTCTCTAAT 

3560 3570 3580 3590 3600 

***** 

TTGTCAAGTC TAATGTAGAT ATGGTTAGTC TTGAAAAGTC ATTGCCGGAA 
AACAGTTCAG ATTACATCTA TACCAATCAG AACTTTTCAG TAACGGCCTT 

3610 3620 3630 3640 3650 

***** 

GAGCTTGTTA AAGAGATAAT TGATAGACGT AAAGAGCTTG GTTTGGAGGT 
CTCGAACAAT TTCTCTATTA ACTATCTGCA TTTCTCGAAC CAAACCTCCA 

3660 3670 3680 3690 3700 

***** 

ACCTAAAGTA AAGAAACATG TCTCGAATGT ACATAAGGCA CTTGACTCGG 
TGGATTTCAT TTCTTTGTAC AGAGCTTACA TGTATTCCGT GAACTGAGCC 

3710 3720 3730 3740 3750 

* * * * * 

ATGATATTGA GTTAGTCAAG TTGCTTTTGA AAGAGGATCA CACCAATCTA 
TACTATAACT CAATCAGTTC AACGAAAACT TTCTCCTAGT GTGGTTAGAT 

3760 3770 3780 3790 3800 
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GATGATGCGT GTGCTCTTCA TTTCGCTGTT GCATATTGCA ATGTGAAGAC 
CTACTACGCA CACGAGAAGT AAAGCGACAA CGTATAACGT TACACTTCTG 



3810 3820 3830 3840 3850 

***** 

CGCAACAGAT CTTTTAAAAC TTGATCTTGC CGATGTCAAC CATAGGAATC 
GCGTTGTCTA GAAAATTTTG AACTAGAACG GCTACAGTTG GTATCCTTAG 

3860 3870 3880 3890 3900 

***** 

CGAGGGGATA TACGGTGCTT CATGTTGCTG CGATGCGGAA GGAGCCACAA 
GCTCCCCTAT ATGCCACGAA GTACAACGAC GCTACGCCTT CCTCGGTGTT 

3910 3920 3930 3940 3950 

***** 

TTGATACTAT CTCTATTGGA AAAAGGTGCA AGTGCATCAG AAGCAACTTT 
AACTATGATA GAGATAACCT TTTTCCACGT TCACGTAGTC TTCGTTGAAA 



3960 3970 3980 3990 4000 

***** 

GGAAGGTAGA ACCGCACTCA TGATCGCAAA ACAAGCCACT ATGGCGGTTG 
CCTTCCATCT TGGCGTGAGT ACTAGCGTTT TGTTCGGTGA TACCGCCAAC 

4010 4020 4030 4040 4050 

***** 

AATGTAATAA TATCCCGGAG CAATGCAAGC ATTCTCTCAA AGGCCGACTA 
TTACATTATT ATAGGGCCTC GTTACGTTCG TAAGAGAGTT TCCGGCTGAT 

4060 4070 4080 

* * * 

TGTGTAGAAA TACTAGAGCA AGAAGACAAA 
ACACATCTTT ATGATCTCGT TCTTCTGTTT 



4090 4100 
* • 

CGAGAACAAA TTCCTAGAGA 
AAGGATCTC T 



4110 4L20 4130 4140 4150 

***** 

TGTTCCTCCC TCTTTTGCAG TGGCGGCCGA TGAATTGAAG ATGACGCTGC 
ACAAGGAGGG AGAAAACGTC ACCGCCGGCT ACTTAACTTC TACTGCGACG 



4160 4170 4180 4190 4200 

***** 

TCGATCTTGA AAATAGAGGT ATCTATCAAG TCTTATTTCT TATATGTTTG 
AGCTAGAACT TTTATCTCCA TAGATAGTTC AGAATAAAGA ATATACAAAC 



4210 4220 4230 4240 4250 

***** 

AATTAAATTT ATGTCCTCTC TATTAGGAAA CTGAGTGAAC TAATGATAAC 
TTAATTTAAA TACAGGAGAG ATAATCCTTT GACTCACTTG ATTACTATTG 



4260 4270 4280 4290 4300 

***** 

TATTCTTTGT GTCGTCCACT GTTTAGTTGC ACTTGCTCAA CGTCTTTTTC 
ATAAGAAACA CAGCAGGTGA CAAATCAACG TGAACGAGTT GCAGAAAAAG 



4310 4320 4330 4340 4350 
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CAACGGAAGC ACAAGCTGCA ATGGAGATCG CCGAAATGAA GGGAACATGT 
GTTGCCTTCG TGTTCGACGT TACCTCTAGC GGCTTTACTT CCCTTGTACA 

4360 4370 4380 4390 4400 

* * * * * 

GAGTTCATAG TGACTAGCCT CGAGCCTGAC CGTCTCACTG GTACGAAGAG 
CTCAAGTATC ACTGATCGGA GCTCGGACTG GCAGAGTGAC CATGCTTCTC 

4410 4420 4430 4440 4450 

***** 

AACATCACCG GGTGTAAAGA TAGCACCTTT CAGAATCCTA GAAGAGCATC 
TTGTAGTGGC CCACATTTCT ATCGTGGAAA GTCTTAGGAT CTTCTCGTAG 

4460 4470 4480 4490 4500 

* * * * * 

AAAGTAGACT AAAAGCGCTT TCTAAAACCG GTATGGATTC TCACCCACTT 
TTTCATCTGA TTTTCGCGAA AGATTTTGGC CATACCTAAG AGTGGGTGAA 

4510 4520 4530 4540 4550 

***** 

CATCGGACTC CTTATCACAA AAAACAAAAC TAAATGATCT TTAAACATGG 
GTAGCCTGAG GAATAGTGTT TTTTGTTTTG ATTTACTAGA AATTTGTACC 

4560 4570 4580 4590 4600 

****•# 

TTTTGTTACT TGCTGTCTGA CCTTGTTTTT TTATCATCAG TGGAACTCGG 
AAAACAATGA ACGACAGACT GGAACAAAAA AATAGTAGTC ACCTTGAGCC 

4610 4620 4630 4640 4650 

***** 

GAAACGATTC TTCCCGCGCT GTTCGGCAGT GCTCGACCAG ATTATGAACT 
CTTTGCTAAG AAGGGCGCGA CAAGCCGTCA CGAGCTGGTC TAATACTTGA 

4660 4670 4680 4690 4700 

***** 

GTGAGGACTT GACTCAACTG GCTTGCGGAG AAGACGACAC TGCTGAAGAA 
CACTCCTGAA CTGAGTTGAC CGAACGCCTC TTCTGCTGTG ACGACTTCTT 

4710 4720 4730 4740 4750 

* * * * * 

ACGACTACAA AAGAAGCAAA GGTACATGGA AATACAAGAG ACACTAAAGA 
TGCTGATGTT TTCTTCGTTT CCATGTACCT TTATGTTCTC TGTGATTTCT 

4760 4770 4780 4790 4B00 

***** 

AGGCCTTTAG TGAGGACAAT TTGGAATTAG GAAATTCGTC CCTGACAGAT 
TCCGGAAATC ACTCCTGTTA AACCTTAATC CTTTAAGCAG GGACTGTCTA 

4810 4820 4830 4840 4850 

* ♦ * * • 

TCGACTTCTT CCACATCGAA ATCAACCGGT GGAAAGAGGT CTAACCGTAA 
AGCTGAAGAA GGTGTAGCTT TAGTTGGCCA CCTTTCTCCA GATTGGCATT 
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4860 4870 4880 4890 4900 

* * * * * 

ACTCTCTCAT CGTCGTCGGT GAGACTCTTG CCTCTTAGTG TAATTTTTGC 
TGAGAGAGTA GCAGCAGCCA CTCTGAGAAC GGAGAATCAC ATTAAAAACG 

4910 4920 4930 4940 4950 

* » * * * 

TGTACCATAT AATTCTGTTT TCATGATGAC TGTAACTGTT TATGTCTATC 
ACATGGTATA TTAAGACAAA AGTACTACTG ACATTGACAA ATACAGATAG 



FT* 



4960 4970 4980 4990 5000 

* * + * * 

GTTGGCGTCA TATAGTTTCG CTCTTCGTTT TGCATCCTGT GTATTATTGC 

CAACCGCAGT ATATCAAAGC GAGAAGCAAA ACGTAGGACA CATAATAACG 



5010 5020 5030 5040 5050 

* * * ♦ * 

TGCAGGTGTG CTTCAAACAA ATGTTGTAAC AATTTGAACC AATGGTATAC 
ACGTCCACAC GAAGTTTGTT TACAACATTG TTAAACTTGG TTACCATATG 

5060 5070 5080 5090 5100 

* » * • * 

AGATTTGTAA TATATATTTA TGTACATCAA CAATAACCCA TGATGGTGTT 
TCTAAACATT ATATATAAAT ACATGTAGTT GTTATTGGGT ACTACCACAA 



5110 5120 5130 5140 5150 

***** 

ACAGAGTTGC TAGAATCAAA GTGTGAAATA ATGTCAAATT GTTCATCTGT 
TGTCTCAACG ATCTTAGTTT CACACTTTAT TACAGTTTAA CAAGTAGACA 



5160 5170 5180 5190 520O 

***** 

TGGATATTTT CCACCAAGAA CCAAAAGAAT ATTCAAGTTC CCTGAACTTC 
ACCTATAAAA GGTGGTTCTT GGTTTTC7TA TAAGTTCAAG GGACTTGAAG 



5210 5220 5230 5240 5250 

***** 

TGGCAACATT CATGTTATAT GTATCTTCCT AATTCTTCCT TTAACCTTTT 
ACCGTTGTAA GTACAATATA CATAGAAGGA TTAAGAAGGA AATTGGAAAA 

5260 5270 5280 5290 5300 

***** 

GTAACTCGAA TTACACAGCA AGTTAGTTTC AGGTCTAGAG ATAAGAGAAC 
CATTGAGCTT AATGTGTCGT TCAATCAAAG TCCAGATCTC TATTCTCTTG 

5310 5320 5330 5340 5350 

***** 

ACTGAGTGGG CGTGTAAGGT GCATTCTCCT AGTCAGCTCC ATTGCATCCA 
TGACTCACCC GCACATTCCA CGTAAGAGGA TCAGTCGAGG TAACGTAGGT 

5360 5370 5380 5390 5400 

***** 

ACATTTGTGA ATGACACAAG TTAACAATCC TTTGCACCAT TTCTGGGTGC 
TGTAAACACT TACTGTGTTC AATTGTTAGG AAACGTGGTA AAGACCCACG 
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5410 5420 5430 5440 5450 

***** 

ATACATGGAA ACTTCTTCGA TTGAAACTTC CCACATGTGC AGGTGCGTTC 
TATGTACCTT TGAAGAAGCT AACTTTGAAG GGTGTACACG TCCACGCAAG 

5460 5470 5480 5490 5500 

***** 

GCTGTCACTG ATAGACCAAG AGACTGAAAG CTTTCACAAA TTGCCCTCAA 
CGACAGTGAC TATCTGGTTC TCTGACTTTC GAAAGTGTTT AACGGGAGTT 

5510 5520 5530 5540 5550 

***** 

ATCTTCTGTT TCTATCGTCA TGACTCCATA TCTCCGACCA CTGGTCATGA 
TAGAAGACAA AGATAGCAGT ACTGAGGTAT AGAGGCTGGT GACCAGTACT 

5560 5570 55B0 5590 5600 

***** 

GCCAGAGCCC ACTGATTTTG AGGGAATTGG GCTAACCATT TCCGAGCTTC 
CGGTCTCGGG TGACTAAAAC TCCCTTAACC CGATTGGTAA AGGCTCGAAG 

5610 5620 5630 5640 5650 

***** 

TGAGTCCTTC TTTTTGATGT CCTTTATGTA GGAATCAAAT TCTTCCTTCT 
ACTCAGGAAG AAAAACTACA GGAAATACAT CCTTAGTTTA AGAAGGAAGA 



5660 5670 5680 5690 5700 

***** 

GACTTGTGGA TCCAGCCTGC TTCACAAGGC TCACCAGGTT GTAGTCTCCA 
CTGAACACCT AGGTCGGACG AAGTGTTCCG AGTGGTCCAA CATCAGAGGT 

5710 5720 5730 5740 5750 

***** 

AAAATATCAT GGAATTGTAA GCAAAAACAA TCCAGACAGA ACCTGTGATA 
TTTTATAGTA CCTTAACATT CGTTTTTGTT AGGTCTGTCT TGGACACTAT 

5760 5770 5780 5790 5800 

***** 

GACCCAAGGT TCTTGCCACA GTGATCCGGG TTCGTTAATA ACAGCAACTA 
CTGGGTTCCA AGAACGGTGT CACTAGGCCC AAGCAATTAT TGTCGTTGAT 

5810 5820 5830 5840 5850 

* * * * * 

TGTCCGGGTG AGGACTGGAG ACGAAGCAAA CGTCTTTCCT TTGTGTTACC 
ACAGGCCCAC TCCTGACCTC TGCTTCGTTT GCAGAAAGGA AACACAATGG 

5860 5870 5880 5890 5900 

***** 

TTCTCTCTGA TATTAGTGAG AAACCAACGC CAACTATCAG TGGACACTTC 
AAGAGAGACT ATAATCACTC TTTGGTTGCG GTTGATAGTC ACCTGTGAAG 

5910 5920 5930 5940 5950 

***** 

TTTGGTAAGC GGAAAGCAAG CGGGAAAAAC AATCATCAGC GTCGAGTCCT 
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AAACCATTCG CCTTTCGTTC GCCCTTTTTG TTAGTAGTCG CAGCTCAGGA 

5960 5970 5980 5990 6000 

***** 

GAGGAAAATC ATCAATTTCA TAGGGGTACT TGCCGTTCAA GTCTTTTGAA 
CTCCTTTTAG TAGTTAAAGT ATCCCCATGA ACGGCAAGTT CAGAAAACTT 

6010 6020 6030 6040 6050 

* * * * * 

TCCACTATGA TCAGAGGTCT ACAGTGTTGA AACCCTTCAA TGGACTGTGG 
AGGTGATACT AGTCTCCAGA TGTCACAACT TTGGGAAGTT ACCTGACACC 

6060 6070 6080 6090 6100 

*#*••** 

AAACGCCCAA AACGCGCCAC CGAAGGATGC AAATTCAGGA TTAGGGAAAA 
TTTGCGGGTT TTGCGCGGTG GCTTCCTACG TTTAAGTCCT AATCCCTTTT 

6110 6120 6130 6140 6150 

***** 

GCTCATATTG CAGTCCACAA GTAGCCCATT AGATGAGTGA AATGCAGCCA 
CGAGTATAAC GTCAGGTGTT CATCGGGTAA TCTACTCACT TTACGTCGGT 

6160 6170 6180 6190 6200 

***** 

ATTAGTTTAG GCAATACTCT GAAACTCTGA TCTTTGATTA CTTCCTGTTC 
TAATCAAATC CGTTATGAGA CTTTGAGACT AGAAACTAAT GAAGGACAAG 

6210 6220 6230 6240 6250 

***** 

TGCTGCCCGC AGCTTTGAAG TTTTAAGCAT GTCACCAAAC TTTTCAACTC 
ACGACGGGCG TCGAAACTTC AAAATTCGTA CAGTGGTTTG AAAAGTTGAG 

6260 6270 6280 6290 6300 

***** 

TGCTGTTAGA GTGGGTTGTA CCCTGATCAG ACACTCAATC TCTTCTGCTG 
ACGACAATCT CACCCAACAT GGGACTAGTC TGTGAGTTAG AGAAGACGAC 

6310 6320 6330 6340 6350 

***** 

CAAATTACAA GTTGAAGTTT TCCGGCTTAA TAGAACAACA AGTATGTGGA 
GTTTAATGTT CAACTTCAAA AGGCCGAATT ATCTTGTTGT TCATACACCT 

6360 6370 6380 6390 6400 

***** 

CCAACTACAC TTAGTTATCT TAACAAGTCC ATGTTCTTCT ATTCAATCTG 
GGTTGATGTG AATCAATAGA ATTGTTCAGG TACAAGAAGA TAAGTTAGAC 

6410 6420 6430 6440 6450 

***** 

CCCGACGCGA CCAATTGCAT TTCCATCTGA TGCATTTAAA CGTATACTCG 
GGGCTGCGCT GGTTAACGTA AAGGTAGACT ACGTAAATTT GCATATGAGC 

6460 6470 6480 6490 6500 
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TCCTTCTCAA TCTCTTGTAC TACACACTTT TGCTGCCCTC TAATGGAACA 
AGGAAGAGTT AGAGAACATG ATGTGTGAAA ACGACGGGAG ATTACCTTGT 

6510 6520 6530 6540 6550 

***** 

CCAGTCCACC GCCTTCTTCA GCTCATCCCT ATCTTTAAAA CACAACCCTA 
GGTCAGGTGG CGGAAGAAGT CGAGTAGGGA TAGAAATTTT GTGTTGGGAT 

6560 6570 6580 6590 6600 

***** 

CACGCAATTC ATGATCATCA ATCCACAAAC TAGACAAAGT ACACTGTTTT 
GTGCGTTAAG TACTAGTAGT TAGGTGTTTG ATCTGTTTCA TGTGACAAAA 

6610 6620 6630 6640 6650 

***** 

GAAGCACTCG AATCAACAAC ACCTTTACTT AATAAGCACG CATACGGTAA 
CTTCGTGAGC TTAGTTGTTG TGGAAATGAA TTATTCGTGC GTATGCCATT 

6660 6670 6680 6690 6700 

***** 

TACCTCTAAG CCTGGCACAT TCAAACCTTG TGTGCATCAT CTGAACCCGA 
ATGGAGATTC GGACCGTGTA AGTTTGGAAC ACACGTAGTA GACTTGGGCT 

6710 6720 6730 6740 6750 

***** 

GTTTTTATCC GTTATTTCTC CATCCCCACC TCCACGAGTG CTACCATTTC 
CAAAAATAGG CAATAAAGAG GTAGGGGTGG AGGTGCTCAC GATGGTAAAG 

6760 6770 6780 6790 6800 

***** 

CGAAGTCAGA ATTTTCCTCG TCTTCAATCC ACCCGTTACT GTTACCCACT 
GCTTCAGTCT TAAAAGGAGC AGAAGTTAGG TGGGCAATGA CAATGGGTGA 

6810 6820 6830 6840 6850 

***** 

CCCTGAACCT CTAAACCATT ATCTCTCTCT ACTTTCACAG ATGCATGTGA 
GGGACTTGGA GATTTGGTAA TAGAGAGAGA TGAAAGTGTC TACGTACACT 

6860 6870 6880 6890 6900 

***** 

CACATAATCA GTAGCTTCTT GGGGTTGTTG CGTCCTCTGT GTATTCGAGG 
GTGTATTAGT CATCGAAGAA CCCCAACAAC GCAGGAGACA CATAAGCTCC 

6910 6920 6930 6940 6950 

***** 

AACTAGCGGG ATATTCTATT ACGGATGAAC AAGCAGCATG ATCAGTAACA 
TTGATCGCCC TATAAGATAA TGCCTACTTG TTCGTCGTAC TAGTCATTGT 

6960 6970 6980 6990 7000 

***** 

TTATCAGATG TCGATTTCAC TTCCAAATAC AACTCCACAT TTCTTATAGA 
AATAGTCTAC AGCTAAAGTG AAGGTTTATG TTGAGGTGTA AAGAATATCT 

7010 7020 7030 7040 7050 
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* * * * * 

AGGATGATAA CTTGGAACTT CAAGCATAGT CTCCAAACTA GTGTCGTTCA 
TCCTACTATT GAACCTTGAA GTTCGTATCA GAG GTTTG AT CACAGCAAGT 

7060 7070 7080 7090 7100 

* * # ♦ * 

CTACATGAAG AAGTAGATAG ATAAAGAGAT CCGGTGAAAC AACTACAGGA 
GATGTACTTC TTCATCTATC TATTTCTCTA GGCCACTTTG TTGATGTCCT 

7110 7120 7130 7140 7150 

* * # * * 

TACTTACCAA AATATATTGA ACACTGATTT CTGCAGCTGC AATCCAAAAA 
ATGAATGGTT TTATATAACT TGTGACTAAA GACGTCGACG TTAGGTTTTT 

7160 7170 7180 7190 7200 

* * * * m 

TTGGATAAAG ACCATTCAAC AATGTACTTA ACGCAGTCTT TTGCCTAACC 
AACCTATTTC TGGTAAGTTG TTACATGAAT TGCGTCAGAA AACGGATTGG 

7210 7220 7230 7240 7250 

* * # * V 

TTGACCGTTT TAGGAGTGGA TCCTTCATAG TAAACACCAT CAGGACCATA 
AACTGGCAAA ATCCTCACCT AGGAAGTATC ATTTGTGGTA GTCCTGGTAT 

7260 7270 7280 7290 7300 

* * # # * 

CTTGGTAGAA CCTTTCTCTC AAGGTTTCCA TCGCCATGAC CATAACAGTC 
GAACCATCTT GGAAAGAGAG TTCCAAAGGT AGCGGTACTG GTATTGTCAG 

7310 7320 7330 7340 7350 

* * * * *■ 

CTGCAGTGAA TTCTAAGAAA AATGTAAAAA ATTTTGGCCT AAACTCATAA 
GACGTCACTT AAGATTCTTT TTACATTTTT TAAAACCGGA TTTGAGTATT 

7360 7370 7380 7390 7400 

* * * ♦ * 

TTCTTAACAT ACGAAACCAT GGAGAACTCC ATGTCTAAAA AATAAAGGCT 
AAGAATTGTA TGCTTTGGTA CCTCTTGAGG TACAGATTTT TTATTTCCGA 

7410 7420 7430 7440 7450 

tt * * * » 

AAAGCTTTTT GGCGACAGAA GCAGATAAAT CCATTCAAAA CACATAAACT 
TTTCGAAAAA CCGCTGTCTT CGTCTATTTA GGTAAGTTTT GTGTATTTGA 

7460 7470 7480 7490 7500 

* # * * * 

CTAAACAATA AACAGTGATA CTCAATACTA AGACTTGTAA AGGTCTACGT 
GATTTGTTAT TTGTCACTAT GAGTTATGAT TCTGAACATT TCCAGATGCA 

7510 7520 7530 7540 

* » ♦ • 

AACTCAAAAC TGGAGAATTG TCAGATCGGG TGTGGCTAGT AGAAGCTT 
TTGAGTTTTG ACCTCTTAAC AGTCTAGCCC ACACCGATCA TCTTCGAA 
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10 20 30 40 50 

***** 

TCGATCTTTA ACCAAATCCA GTTGATAAGG TCTCTTCGTT GATTAGCAGA 
AGCTAGAAAT TGGTTTAGGT CAACTATTCC AGAGAAGCAA CTAATCGTCT 

60 70 80 90 100 

***** 

GATCTCTTTA ATTTGTGAAT TTCAATTCAT CGGAACCTGT TGATGGACAC 
CTAGAGAAAT TAAACACTTA AAGTTAAGTA GCCTTGGACA ACTACCTGTG 

M D T> 

110 120 130 140 150 

***** 

CACCATTGAT GGATTCGCCG ATTCTTATGA AATCAGCAGC ACTAGTTTCG 
GTGGTAACTA CCTAAGCGGC TAAGAATACT TTAGTCGTCG TGATCAAAGC 
TID G F A DSYE ISS TSF> 

160 170 180 190 200 

***** 

TCGCTACCGA TAACACCGAC TCCTCTATTG TTTATCTGGC CGCCGAACAA 
AGCGATGGCT ATTGTGGCTG AGGAGATAAC AAATAGACCG GCGGCTTGTT 
VATD NTD SSI V Y L A A E Q> 

210 220 230 240 250 

***** 

GTACTCACCG GACCTGATGT ATCTGCTCTG CAATTGCTCT CCAACAGCTT 
CATGAGTGGC CTGGACTACA TAGACGAGAC GTTAACGAGA GGTTGTCGAA 
VLT GPDV SAL QLL SNSF> 

260 270 280 290 300 

***** 

CGAATCCGTC TTTGACTCGC CGGATGATTT CTACAGCGAC GCTAAGCTTG 
GCTTAGGCAG AAACTGAGCG GCC TACT AAA GATGTCGCTG CGATTCGAAC 
ESV FDS PDDF Y S D A K L> 



310 320 330 340 350 

***** 

TTCTCTCCGA CGGCCGGGAA GTTTCTTTCC ACCGGTGCGT TTTGTCAGCG 
AAGAGAGGCT GCCGGCCCTT CAAAGAAAGG TGGCCACGCA AAACAGTCGC 
VLSD GRE VSF HRCV LSA> 

360 370 380 390 400 

***** 

AGAAGCTCTT TCTTCAAGAG CGCTTTAGCC GCCGCTAAGA AGGAGAAAGA 
TCTTCGAGAA AGAAGTTCTC GCGAAATCGG CGGCGATTCT TCCTCTTTCT 
R S S FFKS ALA A A K KEKD> 

410 420 430 440 450 

***** 

CTCCAACAAC ACCGCCGCCG TGAAGCTCGA GCTTAAGGAG ATTGCCAAGG 
GAGGTTGTTG TGGCGGCGGC ACTTCGAGCT CGAATTCCTC TAACGGTTCC 
SNN TAA VKLE LKE IAK> 
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460 470 480 490 S0O 

* * * .» * 

ATTACGAAGT CGGTTTCGAT TCGGTTGTGA CTGTTTTGGC TTATGTTTAC 
TAATGCTTCA GCCAAAGCTA AGCCAACACT GACAAAACCG AATACAAATG 
D Y E V GFD S V V TVLA Y V Y> 

510 520 530 540 550 

***** 

AGCAGCAGAG TGAGACCGCC GCCTAAAGGA GTTTCTGAAT GCGCAGACGA 
TCGTCGTCTC ACTCTGGCGG CGGATTTCCT CAAAGACTTA CGCGTCTGCT 
SSR VRPP PKG VSE C A D E> 

560 570 580 590 600 

***** 

GAATTGCTGC CACGTGGCTT GCCGGCCGGC GGTGGATTTC ATGTTGGAGG 

CTTAACGACG GTGCACCGAA CGGCCGGCCG CCACCTAAAG TACAACCTCC 

NCC H V A CRP-A VDF MLE> 

610 620 630 640 650 

* * * * » 

TTCTCTATTT GGCTTTCATC TTCAAGATCC CTGAATTAAT TACTCTCTAT 
AAGAGATAAA CCGAAAGTAG AAGTTCTAGG GACTTAATTA ATGAGAGATA 
VLYL A F I FKI PELI TLY> 

660 670 680 690 700 

* * * * # 

CAGAGGCACT TATTGGACGT TGTAGACAAA GTTGTTATAG AGGACACATT 
GTCTCCGTGA ATAACCTGCA ACATCTGTTT CAACAATATC TCCTGTGTAA 
QRH LLDV VDK VVI EDTL> 

710 720 730 740 750 

***** 

GGTTATACTC AAGCTTGCTA ATATATGTGG TAAAGCTTGT ATGAAGCTAT 
CCAATATGAG TTCGAACGAT TATATACACC ATTTCGAACA TACTTCGATA 
V I L KLA NICG KAC M K L> 

760 770 780 790 800 

***** 

TGGATAGATG TAAAGAGATT ATTGTCAAGT CTAATGTAGA TATGGTTAGT 
ACCTATCTAC ATTTCTCTAA TAACAGTTCA GATTACATCT ATACCAATCA 
LDRC K E I I V K SNVD MVS> 

810 820 830 840 850 

***** 

CTTGAAAAGT CATTGCCGGA AGAGCTTGTT AAAGAGATAA TTGATAGACG 
GAACTTTTCA GTAACGGCCT TCTCGAACAA TTTCTCTATT AACTATCTGC 
LEK SLPE ELV KEI I D R R> 

860 870 880 890 900 

***** 

TAAAGAGCTT GGTTTGGAGG TACCTAAAGT AAAGAAACAT GTCTCGAATG 
ATTTCTCGAA CCAAACCTCC ATGGATTTCA TTTCTTTGTA CAGAGCTTAC 
KEL GLE VPKV KKH V S N> 
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910 920 930 940 950 

* * * * * 

TACATAAGGC ACTTGACTCG GATGATATTG AGTTAGTCAA G TT GC T TTTG 
ATGTATTCCG TGAACTGAGC CTACTATAAC TCAATCAGTT CAACGAAAAC 
VHKA LDS DDI ELVK LLL> 

960 970 980 990 1000 

***** 

AAAGAGGATC ACACCAATCT AGATGATGCG TGTGCTCTTC ATTTCGCTGT 
TTTCTCCTAG TGTGGTTAGA TCTACTACGC ACACGAGAAG TAAAGCGACA 
KED HTNL DDA CAL H F A V> 

1010 1020 1030 1040 1050 

***** 

TGCATATTGC AATGTGAAGA CCGCAACAGA TCTTTTAAAA CTTGATCTTG 
ACGTATAACG TTACACTTCT GGCGTTGTCT AGAAAATTTT GAACTAGAAC 
AYC NVK TATD LLK LDL> 

1060 1070 1080 1090 1100 

***** 

CCGATGTCAA CCATAGGAAT CCGAGGGGAT ATACGGTGCT TCATGTTGCT 
GGCTACAGTT GGTATCCTTA GGCTCCCCTA TATGCCACGA AGTACAACGA 
A D V N HRN PRG YTVL HVA> 

1110 1120 1130 1140 1150 

***** 

GCGATGCGGA AGGAGCCACA ATTGATACTA TCTCTATTGG AAAAAGGTGC 
CGCTACGCCT TCCTCGGTGT TAACTATGAT AGAGATAACC TTTTTCCACG 
AMR KEPQ LIL SLL EKGA> 

1160 1170 1180 1190 1200 

***** 

AAGTGCATCA GAAGCAACTT TGGAAGGTAG AACCGCACTC ATGATCGCAA 
TTCACGTAGT CTTCGTTGAA ACCTTCCATC TTGGCGTGAG TACTAGCGTT 
SAS EAT LEGR TAL MIA> 

1210 1220 1230 1240 1250 

***** 

AACAAGCCAC TATGGCGGTT GAATGTAATA ATATCCCGGA GCAATGCAAG 
TTGTTCGGTG ATACCGCCAA CTTACATTAT TATAGGGCCT CGTTACGTTC 
KQAT M A V ECN NIPE QCK> 

1260 1270 1280 1290 1300 

***** 

CATTCTCTCA AAGGCCGACT ATGTGTAGAA ATACTAGAGC AAGAAGACAA 
GTAAGAGAGT TTCCGGCTGA TACACATCTT TATGATCTCG TTCTTCTGTT 
H S L KGRL CVE I L E QEDK> 

1310 1320 1330 1340 1350 

***** 

ACGAGAACAA ATTCCTAGAG ATGTTCCTCC CTCTTTTGCA GTGGCGGCCG 
TGCTCTTGTT TAAGGATCTC TACAAGGAGG GAGAAAACGT CACCGCCGGC 
REQ IPR DVPP SFA VAA> 
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1360 1370 1380 1390 1400 

***** 

ATGAATTGAA GATGACGCTG CTCGATCTTG AAAATAGAGT TGCACTTGCT 
TACTTAACTT CTACTGCGAC GAGCTAGAAC TTTTATCTCA ACGTGAACGA 
DELK MTL L D L ENRV A L A> 

1410 1420 1430 1440 1450 

***** 

CAACGTCTTT TTCCAACGGA AGCACAAGCT GCAATGGAGA TCGCCGAAAT 
GTTGCAGAAA AAGGTTGCCT TCGTGTTCGA CGTTACCTCT AGCGGCTTTA 
QRL FPTE A Q A AME I A E M> 

1460 1470 1480 1490 1500 

***** 

GAAGGGAACA TGTGAGTTCA TAGTGACTAG CCTCGAGCCT GACCGTCTCA 
CTTCCCTTGT ACACTCAAGT ATCACTGATC GGAGCTCGGA CTGGCAGAGT 
KGT CEF IVTS LEP D R L> 

1510 1520 1530 1540 1550 

***** 

CTGGTACGAA GAGAACATCA CCGGGTGTAA AGATAGCACC TTTCAGAATC 
GACCATGCTT CTCTTGTAGT GGCCCACATT TCTATCGTGG AAAGTCTTAG 
TGTK RTS PGV KIAP F R I> 

1560 1570 1580 1590 1600 

***** 

CTAGAAGAGC ATCAAAGTAG ACTAAAAGCG CTTTCTAAAA CCGTGGAACT 
GATCTTCTCG TAGTTTCATC TGATTTTCGC GAAAGATTTT GGCACCTTGA 
LEE HQSR LKA.-LSK T V E L> 

1610 1620 1630 1640 1650 

***** 

CGGGAAACGA TTCTTCCCGC GCTGTTCGGC AGTGCTCGAC CAGATTATGA 
GCCCTTTGCT AAGAAGGGCG CGACAAGCCG TCACGAGCTG GTCTAATACT 
GKR FFP RCSA VLD Q I M> 

1660 1670 1680 1690 1700 

***** 

ACTGTGAGGA CTTGACTCAA CTGGCTTGCG GAGAAGACGA CACTGCTGAG 
TGACACTCCT GAACTGAGTT GACCGAACGC CTCTTCTGCT GTGACGACTC 
NCED LTQ L A C GED D T A E> 

1710 1720 1730 1740 1750 

***** 

AAACGACTAC AAAAGAAGCA AAGGTACATG GAAATACAAG AGACACTAAA 
TTTGCTGATG TTTTCTTCGT TTCCATGTAC CTTTATGTTC TCTGTGATTT 
KRL QKKQ RYM EIQ ETLK> 

1760 1770 1780 1790 1800 

***** 

GAAGGCCTTT AGTGAGGACA ATTTGGAATT AGGAAATTCG TCCCTGACAG 
CTTCCGGAAA TCACTCCTGT TAAACCTTAA TCCTTTAAGC AGGGACTGTC 
KAF SED NLEL GNS S L T> 
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1810 1820 1830 1840 1850 

* * * * * 

ATTCGACTTC TTCCACATCG AAATCAACCG GTGGAAAGAG GTCTAACCGT 
TAAGCTGAAG AAGGTGTAGC TTTAGTTGGC CACCTTTCTC CAGATTGGCA 
DSTS STS KST GGKR S N R> 

1860 1870 1880 1890 1900 

* * « + * 

AAACTCTCTC ATCGTCGTCG GTGAGACTCT TGCCTCTTAG TGTAATTTTT 
TTTGAGAGAG TAGCAGCAGC CACTCTGAGA ACGGAGAATC ACATTAAAAA 
KLS H R R R *> 



1910 1920 1930 1940 1950 

* » * + * 

GCTGTACCAT ATAATTCTGT TTTCATGATG ACTGTAACTG TTTATGTCTA 
CGACATGGTA TATTAAGACA AAAGTACTAC TGACATTGAC AAATACAGAT 

1960 1970 1980 1990 2000 

***** 

TCGTTGGCGT CATATAGTTT CGCTCTTCGT TTTGCATCCT GTGTATTATT 
AGCAACCGCA GTATATCAAA GCGAGAAGCA AAACGTAGGA CACATAATAA 

2010 2020 2030 2040 2050 

* * * * + 

GCTGCAGGTG TGCTTCAAAC AAATGTTGTA ACAATTTGAA CCAATGGTAT 
CGACGTCCAC ACGAAGTTTG TTTACAACAT TGTTAAACTT GGTTACCATA 

2060 2070 2080 2090 2100 

***** 

ACAGATTTGT AATATATATT TATGTACATC AACAATAAAA AAAAAAAAAA 
TGTCTAAACA TTATATATAA ATACATGTAG TTGTTATTTT TTTTTTTTTT 



AAAA 
TTTT 
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10 20 30 40 50 

***** 

GTGACTTTCT AACTATGGCT GAAATTGCAG AACGAAAAAG ACTTTCCATT 
CACTGAAAGA TTGATACCGA CTTTAACGTC TTGCTTTTTC TGAAAGGTAA 

60 70 80 90 100 

* * * * * 

TTTCACTTGA ATGAAACCCA AAATGGAAAT CTATCTCTCT TCTTCTTCTC 
AAAGTGAACT TACTTTGGGT TTTACCTTTA GATAGAGAGA AGAAGAAGAG 

110 120 130 140 150 

* * * • * 

TTTTACTACC TCCATTTCCA TGGCTTTCCC TCCTCTACCT TCCCTAGCTC 
AAAATGATGG AGGTAAAGGT ACCGAAAGGG AGGAGATGGA AGGGATCGAG 



160 
* 

TTTTCAATTT 
AAAAGTTAAA 

210 

AATTTCTAAG 
TTAAAGATTC 

260 
* 

TAGGACTGCG 
ATCCTGACGC 



170 
* 

CTAGAATATT 
GATCTTATAA 

220 
* 

ACAGAACTTA 
TGTCTTGAAT 

270 

TTTTCTGATT 
AAAAGACTAA 



180 
* 

CTTTTCTTAG 
GAAAAGAATC 

230 
* 

TGTAAGGCGG 
ACATTCCGCC 

280 

CGAATGACAT 
GCTTACTGTA 



190 
* 

TCTGTAATTA 
AGACATTAAT 

240 
* 

CTTTCTGTAA 
GAAAGACATT 

290 

CAGCGGAAGC 
GTCGCCTTCG 



200 

TCTATAGCTC 
AGATATCGAG 

250 

TGGATAATAG 
ACCTATTATC 

300 
* 

AGTAGTATAT 
TCATCATATA 



310 320 330 340 350 

***** 

GCTGCATCGG CGGCGGCATG ACTGAATTTT TCTCGCCGGA GACTTCGCCG 
CGACGTAGCC GCCGCCGTAC TGACTTAAAA AGAGCGGCCT CTGAAGCGGC 

360 370 380 390 400 

***** 

GCGGAGATCA CTTCACTGAA ACGCCTATCG GAAACACTGG AATCTATCTT 
CGCCTCTAGT GAAGTGACTT TGCGGATAGC CTTTGTGACC TTAGATAGAA 

410 420 430 440 450 

***** 

CGATGCGTCT TTGCCGGAGT TTGACTACTT CGCCGACGCT AAGCTTGTGG 
GCTACGCAGA AACGGCCTCA AACTGATGAA GCGGCTGCGA TTCGAACACC 

460 470 480 490 500 

***** 

TTTCCGGCCC GTGTAAGGAA ATTCCGGTGC ACCGGTGCAT TTTGTCGGCG 
AAAGGCCGGG CACATTCCTT TAAGGCCACG TGGCCACGTA AAACAGCCGC 

510 520 530 540 550 

***** 

AGGAGTCCGT TCTTTAAGAA TTTGTTCTGC GGTAAAAAGG AGAAGAATAG 
TCCTCAGGCA AGAAATTCTT AAACAAGACG CCATTTTTCC TCTTCTTATC 



WO 98/06748 



25/34 



PCT/US97/13W4 



560 570 580 590 600 

***** 

TAGTAAGGTG GAATTGAAGG AGGTGATGAA AGAGCATGAG GTGAGCTATG 
ATCATTCCAC CTTAACTTCC TCCACTACTT TCTCGTACTC CACTCGATAC 



610 620 630 640 650 

***** 

ATGCTGTAAT GAGTGTATTG GCTTATTTGT ATAGTGGTAA AGTTAGGCCT 
TACGACATTA CTCACATAAC CGAATAAACA TATCACCATT TCAATCCGGA 



660 670 680 690 700 

***** 

TCACCTAAAG ATGTGTGTGT TTGTGTGGAC AATGACTGCT CTCATGTGGC 
AGTGGATTTC TACACACACA AACACACCTG TTACTGACGA GAGTACACCG 

710 720 730 740 750 

***** 

TTGTAGGCCA GCTGTGGCAT TCCTGGTTGA GGTTTTGTAC ACATCATTTA 
AACATCCGGT CGACACCGTA AGGACCAACT CCAAAACATG TGTAGTAAAT 

760 770 780 790 800 

***** 

CCTTTCAGAT CTCTGAATTG GTTGACAAGT TTCAGAGACA CCTACTGGAT 
GGAAAGTCTA GAGACTTAAC CAACTGTTCA AAGTCTCTGT GGATGACCTA 

810 820 830 840 850 

***** 

ATTCTTGACA AAACTGCAGC AGACGATGTA ATGATGGTTT TATCTGTTGC 

TAAGAACTGT TTTGACGTCG TCTGCTACAT TACTACCAAA ATAGACAACG 



860 870 880 890 900 

***** 

AAACATTTGT GGTAAAGCAT GCGAGAGATT GCTTTCAAGC TGCATTGAGA 
TTTGTAAACA CCATTTCGTA CGCTCTCTAA CGAAAGTTCG ACGTAACTCT 



910 920 930 940 950 

***** 

TTATTGTCAA GTCTAATGTT GATATCATAA CCCTTGATAA AGCCTTGCCT 
AATAACAGTT CAGATTACAA CTATAGTATT GGGAACTATT TCGGAACGGA 

960 970 980 990 1000 

***** 

CATGACATTG TAAAACAAAT TACTGATTCA CGAGCGGAAC TTGGTCTACA 
GTACTGTAAC ATTTTGTTTA ATGACTAAGT GCTCGCCTTG AACCAGATGT 

1010 1020 1030 1040 1050 

***** 

AGGGCCTGAA AGCAACGGTT TTCCTGATAA ACATGTTAAG AGGATACATA 
TCCCGGACTT TCGTTGCCAA AAGGACTATT TGTACAATTC TCCTATGTAT 

1060 1070 1080 1090 1100 

***** 

GGGCATTGGA TTCTGATGAT GTTGAATTAC TACAAATGTT GCTAAGAGAG 
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CCCGTAACCT AAGACTACTA CAACTTAATG ATGTTTACAA CGATTCTCTC 

1110 1120 1130 1140 1150 

***** 

GGGCATACTA CCCTAGATGA TGCATATGCT CTCCATTATG CTGTAGCGTA 
CCCGTATGAT GGGATCTACT ACGTATACGA GAGGTAATAC GACATCGCAT 

1160 1170 1180 1190 1200 

***** 

TTGCGATGCA AAGACTACAG CAGAACTTCT AGATCTTGCA CTTGCTGATA 
AACGCTACGT TTCTGATGTC GTCTTGAAGA TCTAGAACGT GAACGACTAT 

1210 1220 1230 1240 1250 

***** 

TTAATCATCA AAATTCAAGG GGATACACGG TGCTGCATGT TGCAGCCATG 
AATTAGTAGT TTTAAGTTCC CCTATGTGCC ACGACGTACA ACGTCGGTAC 

1260 1270 1280 1290 1300 

***** 

AGGAAAGAGC CTAAAATTGT AGTGTCCCTT TTAACCAAAG GAGCTAGACC 
TCCTTTCTCG GATTTTAACA TCACAGGGAA AATTGGTTTC CTCGATCTGG 

1310 1320 1330 1340 1350 

***** 

TTCTGATCTG ACATCCGATG GAAGAAAAGC ACTTCAAATC GCCAAGAGGC 
AAGACTAGAC TGTAGGCTAC CTTCTTTTCG TGAAGTTTAG CGGTTCTCCG 

1360 1370 1380 1390 1400 

***** 

TCACTAGGCT TGTGGATTTC AGTAAGTCTC CGGAGGAAGG AAAATCTGCT 
AGTGATCCGA ACACCTAAAG TCATTCAGAG GCCTCCTTCC TTTTAGACGA 

1410 1420 1430 1440 1450 

***** 

TCGAATGATC GGTTATGCAT TGAGATTCTG GAGCAAGCAG AAAGAAGAGA 
AGCTTACTAG CCAATACGTA ACTCTAAGAC CTCGTTCGTC TTTCTTCTCT 

1460 1470 1480 1490 1500 

***** 

CCCTCTGCTA GGAGAAGCTT CTGTATCTCT TGCTATGGCA GGCGATGATT 
GGGAGACGAT CCTCTTCGAA GACATAGAGA ACGATACCGT CCGCTACTAA 

1510 1520 1530 1540 1550 

***** 

TGCGTATGAA GCTGTTATAC CTTGAAAATA GAGTTGGCCT GGCTAAACTC 
ACGCATACTT CGACAATATG GAACTTTTAT CTCAACCGGA CCGATTTGAG 

1560 1570 1580 1590 1600 

***** 

CTTTTTCCAA TGGAAGCTAA AGTTGCAATG GACATTGCTC AAGTTGATGG 
GAAAAAGGTT ACCTTCGATT TCAACGTTAC CTGTAACGAG TTCAACTACC 

1610 1620 1630 1640 1650 
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CACTTCTGAG TTCCCACTGG CTAGCATCGG CAAAAAGATG GCTAATGCAC 
GTGAAGACTC AAGGGTGACC GATCGTAGCC GTTTTTCTAC CGATTACGTG 

1660 1670 1680 1690 1700 

* * * * * 

AGAGGACAAC AGTAGATTTG AACGAGGCTC CTTTCAAGAT AAAAGAGGAG 
TCTCCTGTTG TCATCTAAAC TTGCTCCGAG GAAAGTTCTA TTTTCTCCTC 

1710 1720 1730 1740 1750 

***** 

CACTTGAATC GGCTTAGAGC ACTCTCTAGA ACTGTAGAAC TTGGAAAACG 
GTGAACTTAG CCGAATCTCG TGAGAGATCT TGACATCTTG AACCTTTTGC 

1760 1770 1780 1790 1800 

***** 

CTTCTTTCCA CGTTGTTCAG AAGTTCTAAA TAAGATCATG GATGCTGATG 
GAAGAAAGGT GCAACAAGTC TTCAAGATTT ATTCTAGTAC CTACGACTAC 

1810 1820 1830 1840 1850 

.* * * * * 

AC TTGTCTG A GATAGCTTAC ATGGGGAATG ATACGGCAGA AGAGCGTCAA 
TGAACAGACT CTATCGAATG TACCCCTTAC TATGCCGTCT TCTCGCAGTT 

1860 1870 1880 1890 1900 

***** 

CTGAAGAAGC AAAGGTACAT GGAACTTCAA GAAATTCTGA CTAAAGCATT 
GACTTCTTCG TTTCCATGTA CCTTGAAGTT CTTTAAGACT GATTTCGTAA 

1910 1920 1930 1940 1950 

***** 

CACTGAGGAT AAAGAAGAAT ATGATAAGAC TAACAACATC TCCTCATCTT 
GTGACTCCTA TTTCTTCTTA TACTATTCTG ATTGTTGTAG AGGAGTAGAA 

1960 1970 1980 1990 2000 

***** 

GTTCCTCTAC ATCTAAGGGA GTAGATAAGC CCAATAAGCT CCCTTTTAGG 
CAAGGAGATG TAGATTCCCT CATCTATTCG GGTTATTCGA GGGAAAATCC 

2010 2020 2030 2040 2050 

***** 

AAATAGGTAA TTGTATTAGG ATATATGAGG AAGAAGAGGA TTTTCTTGTA 
TTTATCCATT AACATAATCC TATATACTCC TTCTTCTCCT AAAAGAACAT 

2060 2070 2080 2090 2100 

***** 

ACATAGCACT CTTTCCTTTC ATCATTTGAT ATGTCAACAT ACATACAACA 
TGTATCGTGA GAAAGGAAAG TAGTAAACTA TACAGTTGTA TGTATGTTGT 

2110 2120 2130 2140 2150 

***** 

GCTGTACCAT AAACTTGTAT TGTTGCACTT ACAACTTTGA AGAACAGAAT 
CGACATGGTA TTTGAACATA ACAACGTGAA TGTTGAAACT TCTTGTCTTA 



2160 



2170 
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TTATTTGAAA AAAAAAAAAA AA 
AATAAACTTT TTTTTTTTTT TT 
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50 

***** 

MDNSRTAFSDSND I SGSSS I CCIGGGMTEFFS PETS PAE IT SLKRLSETL 

100 

* * * » • 

ES I FDASLP EFDYF ABAKLWSG PCKE I PVHRC ILSARS PFFKNLFCGKK 

150 

***** 

EKNSSKVELKEVMKEHEVSYDAVMSVLAYLYSGKVRPS PKDVCVCVDNDC 

200 

***** 

SHVACRPAVAFLVEVLYTSFTFQI SELVDKFQRHLLDILDKTAADDVMMV 

250 

***** 

LSVANICGKACERLLSSCIEIIVKSNVDIITLDKALPHDIVKQITDSRAE 

300 

***** 

LGLQGPESNGFPDKHVKRIHRALDSDDVELLQMLLREGHTTLDDAYALHY 

350 

***** 

AVAYCDAKTTAELLDLALADINHQNSRGYTVLHVAAMRKEPKIWSLLTK 

400 

***** 

GARPSDLTSDGRKALQIAKRLTRLVDFSKSPEEGKSASNDRLCIEILEQA 

450 

***** 

ERRDPLI^EASVSLAMAGDDLRMKLLYLENRVGLAKLLFPMEAKVAMDIA 

500 

***** 
QVIX3TSEFPIASIGKKMANAQRTTVDLNEAPFKIKEEHLNRLRALSRTVE 

550 

***** 

LG KRF F PRC S EVLNK I MDADD LS E I A YMGNDTAE ERQ LKKQ RYMELQ E I L 



TKAFTEDKEEYDKTNNISSSCSSTSKGVDKPNKLPFRK 
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Dosage effect of NPR1 on Psm ES4326 resistance 
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Dosage effect of NPR1 on growth of P. parasitica 



c 
a 



m 
o> 

w 
O 

x: 
o. 
o 



o 
o 



03 

E 

3 
Z 



1000 



800 h 



600 - 



400 



200 !- 




WO 98/06748 



PCT/US97/13994 



32/34 



F/6. 9C 




0 I 1 » * 1 

Qtseass rating 



WO 98/06748 



PCT7US97/13994 



33/34 



F/f. to 



MS MS-INA 



WO 98/06748 



PCT/US97/13994 



34/34 



Fi&5- It A- 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/t3994 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) : C07K 14/00; C07H 21/04; A01H t/00; C12N 5/00, 15/00 
US CL : 435/410; 530/350; 536/23.2; 800/205 
According to International Patent Classification (IPC) or to both national classification and IPC 


a FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 


Documentation searched other than minimum docu men taboo to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS. DIALOO 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


CAO et al. Characterization of an Arabidopsis Mutant That is 
Nonresponsive to Inducers of Systemic Acquired Resistance. The 
Plant Cell. November 1994. Vol. 6, pages 1583- 1592, see entire 
article 


1-7, 13-25, 31-37 


Y 


KRASTANOVA et al. Transformation of Grapevine Rootstocks 
with the Coat Protein Gene of Grapevine Fan leaf Nepovirus. Plant 
Cell Reports. June 1995. Vol. 14 t No. 9, pages 550-554, see entire 
article. 


1-2, 6-7, 13-25, 
31-32, 35-36 


Y 


US 5,304,730 A (LAWSON et al.) 
document. 


19 April 1994, see entire 


1-2, 6-7, 13-25, 
31-32, 35-36 


| x| Further documents are listed in the continuation of Box C. | | See patent family annex. 


* Special cauteries of cited documnm: 

"A* doemtw* dafmicg the |inwil ea— of the art which it not considered 
to be of pertkiiJar ralwnct 

*B" eartser document pofatitbod oo or after the intaraackm) filing date 

"L" document vhkb Bey (brow deubo on priority cleim(a) or which n 
coed lo tr^pk^tfr the pobbcabos date of another citation or other 
tpeciel raw (as apecs/ied) 

*0* document referring to is oral dttcbsnre. um. csJnbibon or other 

"P* document pubnthed prior to th* interostianel filing den but briar thaa 
tbo pnorrty data claimed 


a T" tatar document pabbihed after tin nneraauonsj filing date or priority 
data and do* ic conflict with the application but cited to understand 
the principle or theory onderryistg the invention 

"X* document of particular relevance; tba claimed invention cannot be 
considered nova) or cannot be coraidered to mrohre an urv cntne step 
when the doctnoeet m takan alone 

*Y* document of particular relevance; the claimed invrauon cannot be 
considered to inrolvs to Brtanuro step wheo the document is 
combined with one or more other such documents, such combination 
being obvious lo a person skilled in (he ert 

"A" document member of the tene patent family 


Date of the actual completion of the international search 
16 SEPTEMBER 1997 


Date of mailing of the international search report 

0 9 JAN «58 


Name and mailing address of the ISA/US 
Coramuxioncr of Patents and Trademark* 
Box PCT 

Washington. D C. 20231 
Facsimile No. f703) 305-3230 


Authorized officer — r— ? / 

PHUONO BUI 
Telephone No. (703) 308-0 196 



Form PCT/1SA/210 (second sbcctXJuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/I3994 



1 



C (Contimiatioa). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


ZHANG ct al. Expression of Antisense or Sense RNA of an 
Ankyrin Repeat Containing Gene Blocks Chloroplast 
Differentiation in Arabidopsis. The Plant Cell. December 1992, 
Vol. 4, No. 12, pages 1575-1588. 


3 



Form PCT7ISA/210 (continuation of second ihcctXJ jty 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/13994 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2X») for the following reasons: 
1. I J Claims Nos.: 

"* because they relate to subject matter not required to be searched by this Authority, namely: 



a 



Claims Nos.: 8-12,26-50.38-41 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out specifically: 



These claims are directed to SEQ ID NO'S. Since no computer-readable form of the disclosed sequences was 
submitted, these claims could not be searched. 

3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item X of first sheet) 



This International Searching Authority found multiple invention! in this international application, as follows: 



1 . | [ As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 



claims, 



2. | j As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

| | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 



4. | J No required additional search fees were timely paid by the applicant Consequently, this international search report is 
restricted to the invention first mentioned in the claims: it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest 

| ) No protest accompanied the payment of additional search fees. 



Form PCT/1SA/2I0 (continuation of first shcet(!)XJuiy 1992)* 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 



^GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




COLOR OR BLACK AND WHITE PHOTOGRAPHS 



